^flMWlPtO 0 8 FEB tm 



B&P File No. 9962-58 

BERESKIN & PARR PCT 



Title: LAFORA'S DISEASE GENE 

Inventors : STEPHEN W. SCHERER and BERGE A. MINASSIAN 



lU/5670/4 

-1 - 

TITLE : LAFORA'S DISEASE GENE 
FIELD OF THE INVENTION 

The invention relates to a novel gene. EPM2B, that is involved in 
Lafora's disease; the protein, malin, encoded by the gene; and methods of 
5 diagnosing and treating Lafora's disease. 
BACKGROUND OF THE INVENTION 

Lafora*s disease (LD, OMIM 254780) is the most common and severe 
form of adolescent-onset progressive epilepsy. Increasing seizures are 
paralleled with an insidious cognitive decline towards dementia, and death 

10 usually within 10 years of onset (1 ,2). At the cellular level, LD is characterized 
by an endoplasmic reticulum (ER)"associated accumulation (3) of starch-like 
glucose polymers (4) called polyglucosans (or Lafora bodies). Inheritance is 
autosomal recessive with genetic heterogeneity but the clinical presentation is 
homogeneous (5), The inventors previously discovered that mutations in the 

15 EPM2A gene on chromosome 6q24 encoding a dual-specificity phosphatase 
(named Laforin) with a carbohydrate brnding domain, cause LD (6, 7 and WO 
00/05405). 

There is a need in the art to identify other genes involved in Lafora's 
disease to assist in the diagnosis and treatment of Lafora^s disease. 

20 SUMMARY OF THE INVENTION 

The present inventors positionally cloned a novel gene, EP/W2S, on 
chromosome 6p22.3. It encodes a protein with a putative RING-finger domain 
and 6 NHL-motifs, which are features of complexes designed for ubiquitin- 
mediated regulation of specific substrates (8,9) and protein-protein 

25 interactions (10-13). respectively. Twenty-one distinct DMA sequence 
variations in EPM2B predicted to cause deleterious effects on the protein 
product, named malin. were found to co-segregate with LD in 39 families. 
Both laforin and malin localize to the ER suggesting they operate in a related 
pathway protecting against neuronal polyglucosan accumulation and epilepsy. 

30 The inventors have also isolated and sequenced the canine version of 
EPM2B and have shown a mutation in EPM2B in dogs with LD. 
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Accordingly, the present invention provides an isolated nucleic acid 
molecule that is associated with Lafora's disease and having the sequence 
shown in SEQ ID NO:1 (Figure 6A) (human EPM2B). The present invention 
also provides an isolated nucleic acid molecule that is associated with 
5 Lafora's disease and having a sequence shown in SEQ ID N0:3 (Figure 7A) 
(canine EP/W2S). 

Preferably, the purified and isolated nucleic aicid molecule comprises: 
(a) a nucleic acid sequence as shown in SEQ ID N0:1 (Figure 6A) 
and SEQ ID N0:3 (Figure 7A). wherein T can also be U; 
10 (b) . a nucleic acid sequence complementary to (a); 

(c) . a nucleic acid sequence that has substantial sequence 
homology to a nucleic acid sequence of (a) or (b); 

(d) a nucleic acid sequence that . is an analog of a nucleic acid 
sequence of (a), (b) or (c); or 

15 (e) a nucleic acid sequence that hybridizes to a nucleic acid 

sequence of (a), (b). (c) or (d) under stringent hybridization conditions. 

*. The present invention also includes an isolated protein containing a 
zinc finger of the RING type and 6 NHL-repeat domains which is associated 
with Lafora's disease. In a preferred embodiment of the invention, the protein 

20 has the amino acid sequence as shown in SEQ ID NO:2 (Figure 6B) (human 
EPM2B). In another embodiment, the protein has the amino acid sequence 
shown in SEQ ID NO:4 (Figure 7B) (canine EPM2B). 

As shown in Table 1. the inventors have found 21 different mutations in 
the human EPM2B gene that are associated with Lafora's disease. 

25 Accordingly, the present invention provides a method of detecting Lafora*s 
disease comprising detecting a mutation in the EPM2B gene in a sample from 
a mammal, preferably human. In a preferred embodiment, the mutation is 
one listed in Table 1 . 

The inventors have also discovered a mutation in the EPM2B gene in 

30 dogs with Lafora's disease. In particular, all affected dogs studied had a bi- 
alielic expansion of a dodecamer repeat, termed D. and having the sequence 
GCCGCCCCCCGC that starts at nucleotide number 1001 of canine EPM2B 
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sequence shown in SEQ ID NO:3. Accordingly, the invention further provides 
a method of detecting Lafora's disease in a canid comprising detecting a 
repeat of the sequence GCCGCCCCCCGC (SEQ ID N0:5) which starts at 
nucleotide number 1001 in the canine sequence of EPM2B (SEQ ID N0:3). 
5 Other features and advantages of the present invention will become 

apparent from the following detailed description. It should be understood, 
however, that the detailed description and the specific examples while 
indicating preferred embodiments of the invention are given by way of 
illustration only, since various changes and modifications within the spirit and 
10 scope of the invention will become apparent to those skilled in the art from 
this detailed description. 
BRIEF DESCRIPTION OF THE DRAWINGS 

The invention will now be described in relation to the drawings in 

which: 

15 Figure 1. The EPM2B region on 6p22.3. The previous 2.2 Mb critical 

interval was delimited by microsatellite markers D6S1567 and BV012568 (1 5); 
The BV012568 boundary was based on loss of homozygosity in LD 
individuals from an F-C family (LD6; see Fig. 2). The D6S1567 telomeric 
boundary was defined by a recombination occurring between BV012563 and 

20 D6S1567 in an unaffected sibling of a second consanguineous F-C family (15) 
(LD27). A break in the chain of homozygosity of markers in the LD38324 
family allowed the centromeric boundary to be further re-defined to D6S1688 
(Figure 2) The region contains 7 previously annotated genes and the newly 
discovered EPM2B (Figure 6A), which comprised of a single 1188bp coding 

25 exon. Representative human and mouse cDNA sequences are shown, as are 
the putative ATG start and AATAAA polyadenylation signals. The ATG start 
follows an in-frame stop (at position -60) and the corresponding AUG is 
present at the beginning of the predicted ORF. The nucleotide sequence 
surrounding the start (CGCGCCAUGG) has the proposed features of the 

30 consensus sequence (GCCA/GCCAUGG) of an eukaryotic translation 
initiation site (29). EPM2B is predicted to encode a 42.3 kDa (395 aa) protein 
(malin) containing detectable zinc-binding RING-finger and 6 NHL-repeat 
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domains (FIGURE 6B). The RING and NHL acronyms arise from descriptions 
of the first proteins identified to carry them, namely, the Really Interesting 
New Gene 1 in Homo sapiens (30) and the NCL-1 (1 1)/HT2A (10)/LIN-41 (13) 
genes, respectively. It should be noted that RING- and/or NHL- domains 
5 occur in a variety of proteins which can have one or both of cytoplasmic or 
nuclear localization (8,9,12,18). Malin is the only protein so far described 
haying RING and NHL motifs only (there are other proteins with this 
combination but they also have other associated motifs such as RiNG-B^Box- 
coiled-coil domains). The site of a common C332T (P111L) polymorphism is 

10 shown by an askerisk (*) (see Table 1). Malin shares 79%, 80%, and 85% 
homology with the predicted rat (419 aa), mouse (401 aa) and dog (402 aa) 
proteins, respectively, the variable amino acids were primarily located in the 
carboxy- and amino- ends of the protein and not in the RING finger or NHL 
domains. The microsatellite markers beginning with BV- were generated in 

15 this study. 

Figure 2. Refinement of the EPM2B critical interval by haplotype 
analysis in LD families and mutations in the EPM2B gene, a, The centromeric 
boundary was narrowed to D6S1688 refining the critical region to 840 kb 
based on the loss of homozygosity in both probands in family LD38324. b, 
20 Sequence analysis of EPM2B identifies a homozygous 76T>A change in 
family LD6 (as well as in the LD7, LD27, and LD28 F-C families, Table 1). 
Affected individuals in family LD38324 were found to be homozygous for a 
dinucleotide deletion (1 048-1 049delGA) leading to frame-shift mutation in the 
fifth NHL domain. 

25 Figure 3. RNA hybridization expression analysis of EPM2B in human 

tissues, a, A multiple tissue blot (Clontech) was hybridized with a 557bp 
fragment of the coding region of EPM2B. Two transcripts 2.4 kb and 1.5 kb in 
size were identified in all tissues, b, The same sized transcripts were found in 
tissues from all regions of the brain tested. 

30 Figure 4. Electron micrograph of brain biopsy material from patient 

LD32817 {EPM2B mutation 98T>C). A, axon (note the numerous normal 
neurotransmitter vesicles); S, synapse; LB, Lafora body (large rounded 
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. structure) composed of a dense accumulation of polyglucosan filaments (PG) 
completely occupying the dendrite. Bar equals 500nm. 

Figure 5. Cellular localization of the malin and laforin proteins, a, Myc- 
tagged malin (construct pcDNA3myc£PM2B) forms a distinct reticular pattern 
5 . around the nucleus, as well as within the nucleus. Co-staining with antibody 
against the GRP94 endoplasmic reticulum (ER)-specific marker reveals co- 
localization with malin. b, Co-localization of the cytoplasmic isoform of laforin 
(construct pcDNA3myc£PM2>A (24)) with the ER-specific marker GRP94. 

Figures 6A and B (SEQ ID NOS:1 and 2) provides the human nucleic 

10 acid and amino acid sequence of EPM2B. 

Figures 7A and B (SEQ ID NOS:3 and 4) provides the canine nucleic 
acid and amino acid sequence for EPM2B, 

Figure 8 shows the profuse starch-like polyglucosan accumulations 
(Lafora Bodies; LB) form in tissues of affected MWHD with characteristics and 

15 cellular distribution patterns identical to human LD (1,29,30). Affected MWHD 
tissues included nine muscle and two liver biopsies, as well as skin samples 
and four whole brains obtained at necropsy. Control tissues did not exhibit LB 
(seven age-matched animals including two normal MWHD and three non-LD 
epileptic dogs) and are not shown. Histochemical slides were stained with 

20 periodic acid-Schiff following diastase treatment, which allows specific 
detection of polyglucosans (1). Innumerable LB were present throughout the . 
brain in affected MWHD. located, as in human LD (1), in neurons and not glia 
and in neuronal perikarya and dendrites and not axons: a, LB (arrows) in the 
molecular layer of cerebral cortex (XI 00 magnification); b, Example of a 

25 dendritic LB (asterisk); multiple axons (A) synapsing with this dendrite do not 
contain LB (bar=500nm). c, In liver, LB (arrows) were only In gluconeogenic 
(periportal) hepatocytes, as in human (29); (PV, portal vein; D, portal ductule; 
X300). d, e, In skin, LB (arrows) were in sweat glands, specifically, as in 
human (30), in myoepithelial cells surrounding apocrine sweat glands (d, 

30 X100) and duct cells of merocrine sweat glands (e, foot pad skin tissue, 
X250). f. In skeletal muscle, LB were within vacuoles surrounded by 
membranes (arrowheads), again as in human LD (29) (bar=500nm). 
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Figure 9 shows MWHD LD links to the EPM2B gene, which contains an 
expansion nnutation in affected dogs, a, Pedigree and haplotypes; numbers in 
black are genotypes, in descending order, at chromosome CFA35 
microsatellites REN256I01. REN282I22, D02001, REN157J09, REN214H22. 
5 REN94K23, REN166C14, REN172L08; in red, the position of the EPM2B 
gene and the number of D repeats in its coding polymorphic dodecamer 
repeat; note: expansion mutations are meiotically unstable (e.g. see 
transmission of expansion from dog 29 to dog 32); E. expanded alleles with 
precise repeat number not determined; boxes, genotype homozygosities 

10 flanking the mutation, b, Multipoint LOD score analysis. 

Figure 10 shows the evolution of the EPM2B dodecamer polymorphism 
with the Canidae family of carnivores, presence of an expansion mutation in a 
non-MWHD epileptic dog, and the inactivating effect of expansions on EPM2B 
mRNA. a. Canine EPM2S dodecamer repeat sequence and orthologues; D, 

15 exact repeats; T, imperfect repeat (asterisk); red nucleotides, differences with 
the canine sequence in the repeat region, b, EPM2B expansion mutation 
associated with myoclonic epilepsy in dogs; W, wild-type; M, affected MWHD; 
B, affected Basset Hound; C, several unaffected carrier MWHD; arrowheads, 
normal alleles; all other bands, expanded mutant alleles; note how 

20 deamination greatly improves PGR detection of expanded alleles even in the 
presence of the normal sequence, c, EPM2B amounts (normalized against 
Gapdh) in skeletal muscle from affected MWHD and controls using real-time 
quantitative RT-PCR (SYBR Green detection; Supplementary Methods), d. 
Evolution of the dodecamer polymorphism; the D sequence is not present in 

25 feline species; it is present in all canoids: as a single copy in arctoids. and in 
polymorphic state with one or two copies in canids; both alleles are shown in 
heterozygous individuals (two sequence rows); all extant carnivores shown 
appeared sometime after 10 million years ago (Ma). 
DETAILED DESCRIPTION OF THE INVENTION 

30 Previous studies by the inventors suggested that -70% of LD patients 

carry recessive mutations in the EPM2A gene on chromosome 6q24 (6,7,14). 
To identify the causative gene(s) in the remaining patients, the inventors 



initially performed linkage and homozygosity mapping on a subset of the non- 
EPM2A ID families (LD6, LD7, LD27, LD28) originating from a French- 
Canadian (F-C) isolate (15) (Table 1). This approach led to the localization of 
a second LD locus {EPM2B) to a 2.2 Mb region on chromosome 6p22.3 
5 (Figure 1). All affected individuals from these F-C families were found to be 
homozygous for a rare haplotype across the entire critical interval (Figure 2a). 
To further refine the locus, five additional LD families having multiply affected 
siblings were examined using every microsatellite marker that could be 
developed from the DNA sequence encompassing the critical region. In four 

10 families, all affected individuals were homozygous for all markers across the 
critical interval. In one family (LD38324), however, the chain of homozygosity 
in two affected individuals extended only partially into the critical region 
allowing reduction of the EPM2B locus to 840 kb between D6S1688 and 
D6Sy567 (Figures 1 and 2a). 

15 L NUCLEIC ACID MOLECULES OF THE INVENTION 

As hereinbefore mentioned, the present invention relates to isolated 
nucleic acid molecules that are involved in Lafora's disease. The term 
"isolated" refers to a nucleic acid substantially free of cellular material or 
culture medium when produced by recombinant DNA techniques, or chemical 

20 precursors, or other chemicals when chemically synthesized. The term 
"nucleic acid" is intended to include DNA and RNA and can be either double 
stranded or single stranded. 

Broadly stated, the present invention provides an isolated nucleic acid 
molecule encoding a protein containing a zinc finger of the RING type in the 

25 N-terminal portion and 6 NHL-repeat domains in the C-terminal portion which 
is associated with Lafora's disease. The isolated nucleic acid molecule is 
preferably the EPM2B gene associated with Lafora's disease. In an 
embodiment of the invention, the isolated nucleic acid molecule has a 
sequence as shown in SEQ ID NO:1 (Figure 6A) or SEQ ID N0:3 (Figure 7A). 

30 The nucleic acid sequences shown in SEQ ID NOS:1 and 3 (or Figures 

6A and 7A, respectively) as well as the mutated sequences specified in Table 
1 or in Example 3 can be collectively referred to herein as "the nucleic acid 



molecules of the invention". The amino acid sequences shown in SEQ ID 
NOS:2 and 4 (or Figures 6B and 7B, respectively) as well as the mutated 
sequences specified in Table 1 can be collectively referred to herein as the 
"proteins of the invention". 
5 Preferably, the purified and isolated nucleic acid molecule comprises 

(a) a nucleic acid sequence as shown in SEQ ID N0:1 (Figure 6A) 
or SEQ ID N0:3 (Figure 7A), wherein T can also be U; 

(b) a nucleic acid sequence complementary to (a); 

(c) a nucleic acid sequence that has substantial sequence 
10 homology to a nucleic acid sequence of (a) or (b); 

(d) a nucleic acid sequence that is an analog of a nucleic acid 
sequence of (a), (b) or (c); or 

(e) a nucleic acid sequence that hybridizes to a nucleic acid 
sequence of (a), (b), (c) or (d) under stringent hybridization conditions. 

15 The term "sequence that has substantial sequence homology" means 

those nucleic acid sequences which have slight or inconsequential sequence 
variations from the sequences in (a) or (b), i.e., the sequences function in 
substantially the same manner and can be used to detect, study or treat 
Lafora's disease. The variations may be attributable to local mutations or 

20 structural modifications. Nucleic acid sequences having substantial homology 
include nucleic acid sequences having at least 65%. more preferably at least 
85%, and most preferably 90-95% identity with the nucleic acid sequences as 
shown in SEQ ID N0:1 or SEQ ID NO:3. 

"Sequence identity" can be calculated according to methods known in 

25 the art. Sequence identity is most preferably assessed by the algorithm of 
BLAST version 2.1 advanced search. BLAST is a series of programs that are 
available online at http://www.ncbi,nlm, nih.gov/BLAST. The advanced blast 
search (http://www.ncbi. nim. nih.gov/blast/blast. cgi?Jform=1) is set to default 
parameters, (ie Matrix BLOSUM62; Gap existence cost 11; Per residue gap 

30 cost 1; Lambda ratio 0.85 default). References to BLAST searches are: 
Altschul, S.F., Gish, W., Miller, W., Myers, E.W. & Lipman, D.J, (1990) "Basic 
local alignment search tool." J. Mol. Biol. 215:403410; Gish, W. & States. DJ. 
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(1993) "Identification of protein coding regions by database similarity search." 
Nature Genet. 3:266272; Madden, T.L, Tatusov. R.L & Zhang. J, (1996) 
"Applications of network BLAST server" Meth. Enzymol, 266:131_141; 
Altschul. S.F.. Madden. T.L., Schaffer, A.A.. Zhang. J., Zhang, Z., Miller, W. & 
5 Lipman, D.J. (1997) "Gapped BLAST and PSLBLAST: a new generation of 
protein database search programs." Nucleic Acids Res. 25:33893402; Zhang, 
J. & Madden, t.L (1997) "PowerBLAST: A new network BLAST application 
for interactive or automated sequence analysis and annotation," Genome 
Res. 7:649656. 

10 The term "sequence that hybridizes" means a nucleic acid sequence 

that can hybridize to a sequence of (a), (b), (c) or (d) under stringent 
hybridization conditions. Appropriate "stringent hybridization conditions" 
which prohiote DNA hybridization are known to those skilled in the art, or may 
be found in Current Protocols in Molecular Biology, John Wiley & Sons. N.Y. 

15 (1989). 6.3.1-6,3.6. The term "stringent hybridization conditions" as used 
herein means that conditions are selected which promote selective 
hybridization between two complementary nucleic acid molecules in solution. 
Hybridization may occur to all or a portion of a nucleic acid sequence 
molecule. The hybridizing portion is at least 50% the length with respect to 

20 one of the polynucleotide sequences encoding a polypeptide. In this regard, 
the stability of a nucleic acid duplex, or hybrids, is determined by the Tm, 
which in sodium containing buffers is a function of the sodium ion 
concentration, G/C content of labeled nucleic acid, length of nucleic acid 
probe (I), and temperature (Tm = 81,5X - 16.6 (LoglO [Na+]) + 0.41(%(G+C) 

25 - 600/1). Accordingly, the parameteris in the wash conditions that determine 
hybrid stability are sodium ion concentration and temperature. In order to 
identify molecules that are similar, but not identical, to a known nucleic acid 
molecule a 1% mismatch may be assumed to result in about a I'^C decrease 
in Tm, for example if nucleic acid molecules are sought that have a greater 

30 than 95% identity, the final wash will be reduced by 5X. Based on these 
considerations stringent hybridization conditions can be defined as: 
hybridization at 5 x sodium chloride/sodium citrate (SSC)/5 x Denhardt's 
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solution/1.0% SDS at Tm (based on the above equation) -5**C, followed by a 
wash of 0.2 X SSC/0.1% SDS at 60'*C. 

The term "a nucleic acid sequence which is an analog" means a 
nucleic acid sequence which has been modified as compared to the sequence 
5 of (a), (b) or (c) wherein the modification does not alter the utility of the 
sequence as described herein. The modified sequence or analog may have 
improved properties over the sequence shown in (a), (b) or (c). One example 
of a modification to prepare an analog is to replace one of the naturally 
occurring bases (i.e. adenine, guanine, cytosine or thymidine) of the 

10 sequence shown in SEQ ID NO:1 or 3. with a modified base such as such as 
xanthine, hypoxanthine, 2-aminoadenine, 6-methyl, 2-propyl and other alkyi 
adenines, 5-halo uracil, 5-halo cytosine, S-aza uracil, 6-aza cytosine and 6- 
aza thymine, pseudo uracil, 4-thiouracil, 8-halo adenine, 8-aminoadenine, 8- 
thiol adenine, 8-thiolalkyl adenines, 8-hydroxyl adenine and other 8- 

15 substituted adenines, 8-halo guanines, 8 amino guanine, 8-thiol guanine, 8- 
thiolalkyl guanines, 8-hydroxyl guanine and other 8-substituted guanines, 
other aza and deaza uracils, thymidines, cytosines, adenines, or guanines, 5- 
trifluoromethyl uracil and 5-trifluoro cytosine. 

Another example of a modification is to include modified phosphorous 

20 or oxygen heteroatoms in the phosphate backbone, short chain alkyI or 
cycloalkyi intersugar linkages or short chain heteroatomic or heterocyclic 
intersugar linkages in the nucleic acid molecule shown in SEQ ID NO:1 or 
SEQ ID NO:3. For example, the nucleic acid sequences may contain 
phosphorothioates, phosphotriesters, methyl phosphonates, and 

25 phosphorodithioates. 

A further example of an analog of a nucleic acid molecule of the 
invention is a peptide nucleic acid (PNA) wherein the deoxyribose (or ribose) 
phosphate backbone in the DNA (or RNA), is replaced with a polyamlde 
backbone which is similar to that found in peptides (P.E. Nielsen, et al 

30 Science 1991, 254, 1497). PNA analogs have been shown to be resistant to 
degradation by enzymes and to have extended lives in vivo and in vitro. 
PNAs also bind stronger to a complimentary DNA sequence due to the lack of 
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charge repulsion between the PNA strand and the DNA strand. Other nucleic 
acid analogs may contain nucleotides containing polymer backbones, cyclic 
backbones, or acyclic backbones. For example, the nucleotides may have 
morpholino backbone structures (U.S. Pat. No. 5.034,506). The analogs may 
5 also contain groups such as reporter groups, a group for improving the 
pharmacokinetic or pharmacodynamic properties of nucleic acid sequence. 

It will be appreciated that the invention includes nucleic acid molecules 
encoding truncations of proteins of the invention, and analogs and homologs 
of proteins of the invention and truncations thereof, as described below. It will 

10 further be appreciated that variant forms of nucleic acid molecules of the 
invention which arise by alternative splicing of an mRNA corresponding to a 
cDNA of the invention are encompassed by the invention. 

Isolated and purified nucleic acid molecules having sequences which 
differ from the nucleic acid sequence of the invention due to degeneracy in 

15 the genetic code are also within the scope of the invention. Such nucleic 
acids encode functionally equivalent proteins but differ in sequence from the 
above mentioned sequences due to degeneracy in the genetic code. 

The invention also includes an isolated nucleic molecule that has a 
mutation as compared to the nucleic acid molecule shown in SEQ ID NO:1 

20 (Figure 6A) or SEQ ID N0:3 (Figure 7A), wherein said mutation is associated 
with Lafora's disease. In a preferred embodiment, the mutation is selected 
from one of the mutations shown in Table 1 or the canine mutation described 
in Example 3. 

Nucleic acid molecules from the EPM2B gene or mutated forms thereof 
25 can be isolated by preparing a labelled nucleic add probe based on all or part 
of the nucleic acid sequences as shown in SEQ ID NO:1 (Figure 6A) and SEQ 
ID NO:3 (Figure 7A) or a mutated sequence shown in Table 1, and using this 
labelled nucleic acid probe to screen an appropriate DNA library (e.g. a cDNA 
or genomic DNA library). Nucleic acids isolated by screening of a cDNA or 
30 genomic DNA library can be sequenced by standard techniques. 

Nucleic acid molecules of the invention can also be isolated by 
selectively amplifying a nucleic acid using the polymerase chain reaction 
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(PGR) methods and cDNA or genomic DNA. It is possible to design synthetic 
oligonucleotide primers from the nucleic acid molecules as shown in SEQ ID 
NO:1 (Figure 6A) and SEQ ID NO:3 (Figure 7A) or a mutated sequence 
shown in Table 1, for use in PGR. A nucleic acid can be amplified from cDNA 
5 or genomic DNA using these oligonucleotide primers and standard PGR 
amplification techniques. The nucleic acid so amplified can be cloned into an 
appropriate vector and characterized by DNA sequence analysis. It will be 
appreciated that cDNA may be prepared from mRNA, by isolating total cellular 
mRNA by a variety of techniques, for example, by using the guanidinium- 

10 thiocyanate extraction procedure of Ghirgwin et al., Biochemistry, 18, 5294- 
5299 (1979). cDNA is then synthesized from the mRNA using reverse 
transcriptase (for example, Moloney MLV reverse transcriptase available 
from Gibco/BRL. Bethesda, MD, or AMV reverse transcriptase available from 
Seikagaku America, Inc., St. Petersburg, FL). 

15 An isolated nucleic acid molecule of the invention which is RNA can be 

isolated by cloning a cDNA encoding a novel protein of the invention into an 
appropriate vector which allows for transcription of the cDNA to produce an 
RNA molecule which encodes the malin protein. For example, a cDNA can 
be cloned downstream of a bacteriophage promoter, (e.g. a T7 promoter) in a 

20 vector, cDNA can be transcribed in vitro with T7 polymerase, and the resultant 
RNA can be isolated by standard techniques. 

A nucleic acid molecule of the invention may also be chemically 
synthesized using standard techniques. Various methods of chemically 
synthesizing polydeoxynucleotides are known, including solid-phase synthesis 

25 which, like peptide synthesis, has been fully automated in commercially 
available DNA synthesizers (See e.g., Itakura et al. U.S. Patent No. 
4,598,049; Gamthers et al. U.S. Patent No. 4.458,066; and Itakura U.S. 
Patent Nos. 4,401,796 and 4,373,071). 

The initiation codon and untranslated sequences of the nucleic acid 

30 molecules of the invention may be determined using currently available 
computer software designed for the purpose, such as PC/Gene 
(IntelliGenetics Inc., Galif.). Regulatory elements can be identified using 
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conventional techniques. The function of the elements can be confirmed by 
using these elements to express a reporter gene which is operatively linked to 
the elements. These constructs may be introduced into cultured cells using 
standard procedures. In addition to identifying regulatory elements in DNA, 
5 such constructs may also be used to identify proteins interacting with the 
elements, using techniques known in the art. 

The sequence of a nucleic acid molecule of the invention may be 
inverted relative to its normal presentation for transcription to produce an 
antisense nucleic acid molecule. Preferably, an antisense sequence is 

10 constructed by inverting a region preceding the initiation codon or an 
unconserved region. In particular, the nucleic acid sequences contained in 
the nucleic acid molecules of the invention or a fragment thereof, preferably a 
nucleic acid sequence shown in SEQ ID N0:1 (Figure 6A) and SEQ ID N0:3 
(Figure 7A) or a mutated sequence shown in Table 1 may be inverted relative 

15 to its normal presentation for transcription to produce antisense nucleic acid 
molecules. 

The antisense nucleic acid molecules of the invention or a fragment 
thereof, may be chemically synthesized using naturally occurring nucleotides 
or variously modified nucleotides designed to increase the biological stability 

20 of the molecules or to increase the physical stability of the duplex formed with 
mRNA or the native gene e.g. phosphorothioate derivatives and acridine 
substituted nucleotides. The antisense sequences may be produced 
biologically using an expression vector introduced into cells in the form of a 
recombinant plasmid, phagemid or attenuated virus in which antisense 

25 sequences are produced under the control of a high efficiency regulatory 
region, the activity of which may be determined by the cell type into which the 
vector is introduced. 

The invention also provides nucleic acids encoding fusion proteins 
comprising a novel protein of the invention and a selected protein, or a 

30 selectable marker protein (see below). 

11. NOVEL PROTEINS OF THE INVENTION 
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The invention further includes an isolated protein encoded by the 
nucleic acid molecules of the invention. Within the context of the present 
invention, a protein of the invention nnay include various structural forms of the 
primary protein which retain biological activity. 
5 Broadly stated, the present invention provides an isolated protein 

containing a zinc finger of the RING type in the N-terminal half and 6 NHL- 
repeat domains in the C-terminal direction which is associated with Lafora's 
disease. Preferably, the zinc-binding RING-finger motif (C-Xa-C-Xie-C-Xi-H- 
X2-C-X2-C-X14-C-X2-C) is located between residues 26-71 of the malin protein 

10 shown in Figure 6B. The presence of a RING finger is predictive of an E3 
ubiquitin ligase function. Therefore, in a preferred embodiment, the protein 
has a ubiquitin ligase function. 

In a. specific embodiment of the invention, the protein has the amino 
acid sequence as shown in SEQ ID NO:2 (Figure 6B) (human EPM2B), In 

15 another embodiment, the protein has the amino acid sequence shown in SEQ 
ID NO:4 (Figure 7B) (canine EPM2B). 

In addition to full length amino acid sequences the proteins of the 
present invention also include truncations of the protein, and analogs, and 
homologs of the protein and truncations thereof as described herein. 

20 Truncated proteins may comprise peptides of at least fifteen amino acid 
residues. 

Analogs of the protein having the amino acid sequence shown in SEQ 
ID N0:2 (Figure 6B) or SEQ ID N0:4 (Figure 78) and/or truncations thereof 
as described herein, may include, but are not limited to an amino acid 

25 sequence containing one or more amino acid substitutions, insertions, and/or 
deletions. Amino acid substitutions may be of a conserved or non-conserved 
nature. Conserved amino acid substitutions involve replacing one or more 
amino acids of the proteins of the invention with amino acids of similar charge, 
size, and/or hydrophobicity characteristics. When only conserved 

30 substitutions are made the resulting analog should be functionally equivalent. 
Non-conserved substitutions involve replacing one or more amino acids of the 
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amino acid sequence with one or more amino acids which possess dissimilar 
charge, size, and/or hydrophobicity characteristics. 

One or more amino acid insertions may be introduced into the amino 
acid sequences shown in SEQ ID N0:2 (Figure 6B) or SEQ ID N0:4 (Figure 

5 7B). Amino acid insertions may consist of single amino acid residues or 
sequential amino acids ranging from 2 to 15 amino acids in length. For 
example, amino acid insertions may be used to destroy target sequences so 
that the protein is no longer active. This procedure may be used in vivo to 
inhibit the activity of a protein of the invention. 

10 Deletions may consist of the removal of one or more amino acids, or 

discrete portions from the amino acid sequence shown in SEQ ID N0:2 
(Figure 6B) or SEQ ID N0:4 (Figure 78). The deleted amino acids may or 
may not be contiguous. The lower limit length of the resulting analog with a 
deletion mutation is about 10 amino acids, preferably 100 amino acids. 

15 Analogs of a protein of the invention may be prepared by introducing 

mutations in the nucleotide sequence encoding the protein. Mutations in 
nucleotide sequences constructed for expression of analogs of a protein of 
the invention must preserve the reading frame of the coding sequences. 
Furthermore, the mutations will preferably not create complementary regions 

20 that could hybridize to produce secondary mRNA structures, such as loops or 
hairpins, which could adversely affect translation of the receptor mRNA. 

Mutations may be . introduced at particular loci by synthesizing 
oligonucleotides containing a mutant sequence, flanked by restriction sites 
enabling ligation to fragments of the native sequence. Following ligation, the 

25 resulting reconstructed sequence encodes an analog having the desired 
amino acid insertion, substitution, or deletion. 

Alternatively, oligonucleotide-directed site specific mutagenesis 
procedures may be employed to provide an altered gene having particular 
codons altered according to the substitution, deletion, or insertion required. 

30 Deletion or truncation of a protein of the invention may also be constructed by 
utilizing convenient restriction endonuclease sites adjacent to the desired 
deletion. Subsequent to restriction, overhangs may be filled in, and the DNA 
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religated. , Exemplary methods of making the alterations set forth above are 
disclosed by Sambrook et al (Molecular Cloning: A Laboratory Manual. 2nd 
. Ed., Cold Spring Harbor Laboratory Press. 1989), 

The proteins of the Invention also include homologs of the amino acid 
5 sequence shown in SEQ ID NO:2 (Figure 6B) or SEQ ID NO:4 (Figure 7B) 
and/or truncations thereof as described herein. Such homologs are proteins 
whose amino acid sequences are comprised of amino acid sequences that 
hybridize under stringent hybridization conditions (see discussion of stringent 
hybridization conditions herein) with a probe used to obtain a protein of the 
10 invention. Preferably, homologs of a protein of the invention will have a 
tyrosine phosphatase region which is characteristic of the protein. 

A homologous protein includes a protein with an amino acid sequence 
having at least 70%, preferably 80-90% identity with the amino acid sequence 
as shown in SEQ ID NO:2 (Figure 6B) or SEQ ID NO:4 (Figure 78). 
15 Sequence identity is as previously defined herein. 

The invention also contemplates isoforms of the proteins of the 
invention. An isoform contains the same number and kinds of amino acids as 
a protein of the invention, but the isoform has a different molecular structure. 
The isoforms contemplated by the present invention are those having the 
20 same properties as a protein of the invention as described herein. 

The invention also includes an isolated protein that has a mutation as 
compared to the amino acid sequence shown in SEQ ID NO:2 (Figure 68) or 
SEQ ID NO:4 (Figure 78). wherein said mutation is associated with Lafora's 
disease. In a preferred embodiment, the mutation is selected from one of the 
25 mutations shown in Table 1 or the canine mutation described in Examj3le 3. 

The present invention also includes a protein of the invention 
conjugated with a selected protein, or a selectable marker protein (see below) 
to produce fusion proteins. Additionally, immunogenic portions of a protein of 
the invention are within the scope of the invention. 
30 The proteins of the invention (including truncations, analogs, mutants 

etc.) may be prepared using recombinant DNA methods. Accordingly, the 
nucleic acid molecules of the present invention having a sequence which 
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encodes a protein of the invention may be incorporated in a known manner 
into an appropriate expression vector which ensures good expression of the 
protein. Possible expression vectors include but are not limited to cosmids, 
plasmids, or modified viruses (e.g. replication defective retroviruses, 
5 adenoviruses and adeno-associated viruses), so long as the vector is 
compatible with the host ceil used. The expression vectors are '^suitable for 
transfonnation of a host cell", means that the expression vectors contain a 
nucleic acid molecule of the invention and regulatory sequences selected on 
the basis of the host cells to be used for expression, which is operatively 
10 linked to the nucleic acid molecule. Operatively linked is intended to mean 
that the nucleic acid is linked to regulatory sequences in a manner which 
allows expression of the nucleic acid. 

The invention therefore contemplates a recombinant expression vector 
of the invention containing a nucleic acid molecule of the invention, or a 
15 fragment thereof, and the necessary regulatory sequences for the 
transcription and translation of the inserted protein-sequence. Suitable 
regulatory sequences may be derived from a variety of sources, including 
bacterial, fungal, or viral genes (For example, see the regulatory sequences 
described in Goeddel, Gene Expression Technology: Methods in Enzymology 
20 185, Academic Press, San Diego, CA (1990). Selection of appropriate 
regulatory sequences is dependent on the host cell chosen, and may be 
readily accomplished by one of ordinary skill in the art. Examples of such 
regulatory sequences include: a transcriptional promoter and enhancer or 
RNA polymerase binding sequence, a ribosomal binding sequence, including 
25 a translation initiation signal. Additionally, depending on the host cell chosen 
and the vector employed, other sequences, such as an origin of replication, 
additional DNA restriction sites, enhancers, and sequences conferring 
inducibility of transcription may be incorporated into the expression vector. It 
will also be appreciated that the necessary regulatory sequences may be 
30 supplied by the native protein and/or its flanking regions. 

The invention further provides a recombinant expression vector 
comprising a DNA nucleic acid molecule of the invention cloned into the 
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expression vector in an antisense orientation. Tliat is, the DNA molecule is 
operatively linked to a regulatory sequence in a manner which allows for 
expression, by transcription of the DNA molecule, of an RNA molecule which 
is antisense to a nucleotide sequence comprising the nucleotides as shown 
5 SEQ ID N0:1 or SEQ ID NO:3. Regulatory sequences operatively linked to 
the antisense nucleic acid can be chosen which direct the continuous 
expression of the antisense RNA molecule. 

The recombinant expression vectors of the invention may also contain 
a selectable marker gene which facilitates the selection of host cells 
10 transformed or transfected with a recombinant molecule of the invention. 
Examples of selectable marker genes are genes encoding a protein such as 
G418 and hygromycin which confer resistance to certain drugs, p- 
galactosidase, chloramphenicol acetyltransferase, or firefly luciferase: 
Transcription of the selectable marker gene is monitored by changes in the 
15 concentration of the selectable marker protein such as p-galactosidase, 
chloramphenicol acetyltransferase. or firefly luciferase. If the selectable 
marker gene encodes a protein conferring antibiotic resistance such as 
. neomycin resistance transformant cells can be selected with G418. Cells that 
have incorporated the selectable marker gene will survive, while the other 
20 cells die. This makes it possible to visualize and assay for expression of 
recombinant expression vectors of the invention and in particular to determine 
the effect of a mutation on expression and phenotype. It will be appreciated 
that selectable markers can be introduced on a separate vector from the 
nucleic acid of interest. 
25 The recombinant expression vectors may also contain genes which 

encode a fusion moiety which provides increased expression of the 
recombinant protein; increased solubility of the recombinant protein; and aid 
in the purification of a target recombinant protein by acting as a ligand in 
affinity purification. For example, a proteolytic cleavage site may be added to 
30 the target recombinant protein to allow separation of the recombinant protein 
from the fusion moiety subsequent to purification of the fusion protein. 
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Recombinant expression vectors can be introduced into host cells to 
produce a transformed host cell. The term "transformed host cell" is intended 
to include prokaryotic and eukaryotic cells which have been transformed or 
transfected with a recombinant expression vector of the invention. The terms 
"transformed with", "transfected with", "transformation" and "transfection" are 
intended to encompass introduction of nucleic acid (e.g. a vector) into a cell 
by one of many possible techniques known in the art. Prokaryotic cells can 
be transformed with nucleic acid by, for example, electroporation or calcium- 
chloride mediated transformation. Nucleic acid can be introduced into 
mammalian cells via conventional techniques such as calcium phosphate or 
calcium chloride co precipitation, DEAE-dextran-mediated transfection, 
lipofectin, electroporation or microinjection. Suitable methods for transforming 
and transfecting host cells can be found in Sambrook et al. (Molecular 
Cloning: A Laboratory Manual. 2nd Edition, Cold Spring Harbor Laboratory 
press (1989)), and other laboratory textbooks. 

Suitable host cells include a wide variety of prokaryotic and eukaryotic 
host cells. For example, the proteins of the invention may be expressed in 
bacterial cells such as E. coli, insect cells (using baculovirus), yeast cells or 
mammalian cells. Other suitable host cells can be found in Goeddel, Gene 
Expression Technology: Methods in Enzymology 185, Academic Press, San 
Diego, CA (1991). 

The proteins of the invention may also be prepared by chemical 
synthesis using techniques well known in the chemistry of proteins such as 
solid phase synthesis (Merrifield, 1964, J. Am. Chem. Assoc. 85:2149- 
2154) or synthesis in homogenous solution (Houbenweyl, 1987, Methods of 
Organic Chemistry, ed. E. Wansch, Vol. 15 I and II, Thieme. Stuttgart). 
III. APPLICATIONS 

The present invention includes all uses of the nucleic acid molecule 
and proteins of the invention including, but not limited to, the preparation of 
antibodies and antisense oligonucleotides, the preparation of experimental 
systems to study EPM2B and mutated forms thereof, the isolation of 
substances that modulate EPM2B expression and/or activity as well as the 
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use of the EPM2B nucleic acid sequences and proteins and modulators 
thereof in diagnostic and therapeutic applications. Some of the uses are 
further described below. 
A. Diagnostic Applications 

5 As previously mentioned, the present inventors have determined that 

the gene EPM2B is mutated in people and canids with Lafora's disease. As a 
result, the present invention also includes a method of detecting Lafora's 
disease by detecting a mutation in the EPM2B gene or protein. 

The term "mutation" means any change or difference in the nucleic acid 

10 or protein sequence of EPM2B as compared to the wild type sequence. 
Mutations include, but are not limited to, nonsense mutations, missense 
mutations, frameshift mutations, rearrangement mutations, insertion mutations 
and deletion mutations. 

The sample can be any sample containing the EPM2B gene or protein 

15 including, but not limited to, biological fluids (such as blood, urine, 
cerebrospinal fluid, tears, saliva), tissues, tissue extracts, cells and cell 
extracts. 

(1) Detecting Mutations in the Nucleic Acid Sequence 

In one embodiment, the present invention provides a method for 
20 detecting Lafora's disease comprising detecting a mutation in the EPM2B 
gene in a sample obtained from an animal, preferably a mammal, more 
preferably a human or canid. 

Table 1 summarizes some of the mutations found in human EPM2B in 
patients with Lafora's disease. To date, 21 different DNA sequence 
25 alterations have been found in EPM2B in 39 families including 8 deletions and 
1 insertion leading to frame-shifts, 8 missense, and 1 non-sense change. 
Screening assays can be developed for each of the mutations. 

The most common mutation identified in seven families is a 
homozygous 205C->G transition resulting in a proline to alanine change in the 
30 RING-finger domain. Accordingly, in one embodiment, the present invention 
provides a method of detecting Lafora's disease comprising detecting a C 
G mutation at position 205 in the EPM2B gene (SEQ ID NO:1). 
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Another mutation observed in the four consanguineous F-C families 
used in the original linkage study all carried a homozygous 76T^A change 
producing a cysteine-to-serine alteration in one of the 7 conserved cysteine 
residues that are critical for the zinc-binding ability of the RING-finger domain. 

5 Accordingly, In another embodiment, the present invention provides a method 
of detecting Lafora's disease comprising detecting a T ->■ A mutation at 
position 76 in the EPM2B gehe (SEQ ID NO: 1 ). 

Another mutation was observed in the LD38324 family that was critical 
in refining the EPM2B locus (Figure 2a) which was a homozygous 2-bp 

10 deletion (1 048-1 049delGA) leading to a frame-shift mutation in the fifth NHL- 
domain (Figure 2b). Accordingly, in a further embodiment, the present 
invention provides a method of detecting Lafora's disease comprising 
detecting a deletion of GA at positions 1048 and 1049 in the EPM2B gene 
(SEQIDNO-.I). -. 

15 As described in Example 3. "the inventors have also discovered a 

mutation in the EPM2B gene in dogs with Lafora's disease. In particular, all 
affected dogs studied had a bi-allelic expansion of a dodecamer repeat, 
termed D, and having the sequence GCCGCCCCCCGC that starts at 
nucleotide number 1001 of canine EPM2B sequence shown in SEQ ID N0:3. 

20 The inventors have shown that this 12 nucleotide repeat is specific to the 
canid superfamily, which includes dogs, wolves, foxes, coyotes, and jackals, 
and have shown that this repeat predisposes dogs to a massive sequence 
expansion, which is destructive to the EPM2B gene and causes Lafora 
disease. The inventors have thus discovered that canids, including dogs, are 

25 predisposed to Lafora Epilepsy. Accordingly, the invention further provides a 
method of detecting Lafora's disease in a canid comprising detecting a repeat 
of the sequence GCCGCCCCCCGC (SEQ ID NO:5) which starts at 
nucleotide number 1001 in the canine sequence of EPM2B (SEQ ID N0:3). 
In one embodiment, the method involves detecting at least 3 repeats, 

30 preferably at least 10 repeats, more preferably from about 14 to about 26 
repeats in SEQ ID N0:5. 
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One skilled in the art will appreciate that many suitable methods, in 
addition to and including the ones discussed in the examples, can be used to 
detect mutations in the EPM2B gene. Detection methods that can be used 
Include, but are not limited to, polymerase chain reaction (PGR), reverse 
5 transcriptase PGR, direct sequencing electrophoretic mobility, nucleic acid 
hybridization, fluorescent in situ hybridization, denaturing high performance 
liquid chromatography, DNA chip technologies and mass spectroscopy. In 
.. one example, In order to isolate nucleic acids from the Lafora's disease gene 
in a sample, one can prepare nucleotide probes from the nucleic acid 
10 sequences of the invention. In addition, the nucleic acid probes described 
herein can also be used. A nucleotide probe may be labelled with a 
detectable marker such as a radioactive label which provides for an adequate 
signal and has sufficient half life such as ^^P, ^H, ^''G or the like. Other 
detectable markers which may be used include antigens that are recognized 
15 by a specific labelled antibody, fluorescent compounds, enzymes, antibodies 
specific for a labelled antigen, and chemiluminescent compounds. An 
appropriate label may be selected having regard to the rate of hybridization 
and binding of the probe to the nucleotide to be detected and the amount of 
nucleotide available for hybridization. 
20 Accordingly, the present invention also relates to a method of detecting 

the presence of a nucleic acid molecule from the EPM2B gene in a sample 
comprising contacting the sample under hybridization conditions with one or 
more of nucleotide probes which hybridize to the nucleic acid molecules and 
are labelled with a detectable marker, and determining the degree of 
25 hybridization between the nucleic acid molecule in the sample and the 
. nucleotide probes. 

Hybridization conditions which may be used in the methods of the 
invention are known in the art and are described for example in Sambrook J. 
Fritch EF, Maniatis T. In: Molecular Gloning, A Laboratory Manual, 1989. 
30 (Nolan G, Ed.), Gold Spring Harbor Laboratory Press, Gold Spring Harbor, 
NY. The hybridization product may be assayed using techniques known in 
the art. The nucleotide probe may be labelled with a detectable marker as 
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described herein and the hybridization product may be assayed by detecting 
the detectable marker or the detectable change produced by the detectable 
marker. 

Prior to hybridizing a sample with DNA probes, the sample can be 
5 treated with primers that flank the EPM2B gene in order to amplify the nucleic 
acid sequences in the sample. The primers used may be the ones described 
in the present application. In addition, the sequence of the EPM2B gene 
provided herein also permits the identification and isolation, or synthesis of 
new nucleotide sequences which may be used as primers to amplify a nucleic 
10 acid molecule of the invention, for example in the polymerase chain reaction 
(PGR) which is discussed in more detail below. The primers may be used to 
amplify the genomic DNA of other species. The PGR amplified sequences 
can be examined to determine the relationship between the genes of various 
species. 

15 The length and bases of the primers for use in the PGR are selected so 

that they will hybridize to different strands of the desired sequence and at 
relative positions along the sequence such that an extension product 
synthesized from one primer when it is separated from its template can serve 
as a template for extension of the other primer into a nucleic acid of defined 

20 length. Primers which may be used in the invention are oligonucleotides i.e. 
molecules containing two or more deoxyribonucleotides of the nucleic acid 
molecule of the invention which occur naturally as in a purified restriction 
endonuclease digest or are produced synthetically using techniques known in 
the art such as for example phosphotriester and phosphodiester methods 

25 (See Good et al Nucl. Acid Res 4:2157, 1977) or automated techniques (See 
for example, Gonolly, B ,A. Nucleic Acids Res. 15:15(7): 3131, 1987). The 
primers are capable of acting as a point of initiation of synthesis when placed 
under conditions which permit the synthesis of a primer extension product 
which is complementary to the DNA sequence of the invention i.e. in the 

30 presence of nucleotide substrates, an agent for polymerization such as DNA 
polymerase and at suitable temperature and pH. Preferably, the primers are 
sequences that do not form secondary structures by base pairing with other 
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copies of the primer or sequences that form a hair pin configuration. The 
primer preferably contains between about 7 and 25 nucleotides. 

The primers may be labelled with detectable markers which allow for 
detection of the amplified products. Suitable detectable markers are 

5 radioactive markers such as P-32, S-35, 1-125, and H-3, luminescent markers 
such as chemiluminescent markers, preferably luminol, and fluorescent 
markers, preferably dansyl chloride, fluorcein-5-isothiocyanate, and 4-fluor-7- 
nitrobenz-2-axa-1.3 diazole, enzyme markers such as horseradish 
peroxidase, alkaline phosphatase, p-galactosidase. acetylcholinesterase, or 

10 biotin. 

It will be appreciated that the primers may contain non-complementary 
sequences provided that a sufficient amount of the primer contains a 
sequence which is complementary to a nucleic, acid molecule of the invention 
or oligonucleotide fragment thereof, which is to be amplified. Restriction site 

15 linkers may also be incorporated into the primers allowing for digestion of the 
amplified products with the appropriate restriction enzymes facilitating cloning 
and sequencing of the amplified product. 

In an embodiment of the invention a method of determining the 
presence of a nucleic acid molecule of the invention is provided comprising 

20 treating the sample with primers which are capable of amplifying the nucleic 
acid molecule or a predetermined oligonucleotide fragment thereof in a 
polymerase chain reaction to form amplified sequences, under conditions 
which permit the formation of amplified sequences and, assaying for amplified 
sequences. 

25 The polymerase chain reaction refers to a process for amplifying a 

target nucleic acid sequence as generally described in Innis et al, Academic 
Press, 1990 in Mullis el al.. U.S. Pat. No. 4,863,195 and MuHis, U.S. Patent 
No. 4,683,202 which are incorporated herein by reference. Conditions for 
amplifying a nucleic acid template are described in M.A. Innis and D.H. 

30 Gelfand, PGR Protocols, A Guide to Methods and Applications M.A. Innis, 
D.H. Gelfand, J.J. Sninsky and T.J. White eds, pp3-12. Academic Press 1989. 
which is also incorporated herein by reference. 
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The amplified products can be isolated and distinguished based on 
their respective sizes using techniques known in the art. For example, after 
amplification, the DNA sample can be separated on an agarose gel and 
visualized, after staining with ethidium bromide, under ultra violet (UW) light. 
5 DNA may be amplified to a desired level and a further extension reaction may 
be performed to incorporate nucleotide derivatives having detectable markers 
such as radioactive labelled or biotin labelled nucleoside triphosphates. The 
primers may also be labelled with detectable markers as discussed above. 
The detectable markers may be analyzed by restriction and electrophoretic 

id separation or other techniques known in the art. 

The conditions which may be employed in the methods of the invention 
using PGR are those which permit hybridization and amplification reactions to 
proceed in the presence of DNA in a sample and appropriate complementary 
hybridization primers. Conditions suitable for the polymerase chain reaction 

15 are generally known in the art. For example, see M.A. Innis and D.H. Gelfand, 
PGR Protocols, A guide to Methods and Applications M.A. Innis, D M. 
Gelfand, J.J. Sninsky and T.J. White eds, pp3-12, Academic Press 1989. 
which is incorporated herein by reference. Preferably, the PGR utilizes 
polymerase obtained from the thermophilic bacterium Thermus aquatics (Taq 

20 polymerase, GeneAmp Kit, Perkin Elmer Getus) or other thermostable 
polymerase may be used to amplify DNA template strands. 

It will be appreciated that other techniques such as the Ligase Chain 
Reaction (LCR) and NASBA may be used to amplify a nucleic acid molecule 
of the invention (Barney in "PGR Methods and Applications", August 1991, 

25 Vol. 1(1), page 5, and European Published Application No. 0320308, 
published June 14, 1989, and U.S. Serial NO. 5,130,238 to Maiek). 

In some embodiments of the present invention, mutations in the 
sequences of the invention may be detected by the direct sequencing of 
nucleic acid molecules. Techniques for the direct sequencing of DNA are well 

30 known in the art. In one embodiment of the invention, mutations may be 
detected cycle sequencing, which may include the use of a thermostable 
polymerase enzyme, a sequencing primer, dNTPs and limiting amounts of 
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chain terminating fluorescently or radioactively labelled ddNTPs. 
Polyacrylamide gel electrophoresis or another technique such as capillary 
electrophoresis may be used to separate the products of the sequencing 
reactions followed by the detection of the fluorescent or radioactive labels. In 
5 one example of this embodiment of the invention, mutations in EMP2B could 
be determined using automated sequencing on an Applied Biosystems 3700 
DNA Analyzer or 3730x1 DNA Analyzer™. Mutations may be identified by 
comparing the sequence of a patient to that of a wildtype individual or to 
reference sequences found in the public databases. 
10 Mutations in EMP2B may also be detected using the Invader™ 

genotyping system which uses a Cleavase™ Fragment Length Polymorphism 
(CFLP) assay as disclosed in US patent No 5.888,780 and available from 
Third Wave™ Technologies in Madison, Wisconsin. 

Other embodiments of the invention contemplate the use of DNA chip 
15 technologies for the detection of mutations within the EMP2B gene. Among 
other applications, DNA chip technologies allow for the identification of 
mutations within the sequences of the intention through the analysis of the 
hybridisation patterns of a nucleic acid sample onto a high-density spatially 
addressable microan-ay of predetermined sequences. One example of a 
20 DNA chip technology suitable for the identification of mutations is the 
sequences of the invention is the Affymatrix GeneChip™ system, as disclosed 
in U.S. patent No. 6,045,996. This system uses photolithography and solid- 
phase chemistry to produce high density arrays containing hundreds of 
thousands of oligonucleotide probes. Genechips™ may be designed so as to 
25 facilitate the re-sequencing of a particular sequence, allowing for the 
identification of specific mutations. Computer based image analysis of the 
hybridisation patterns enables automatic base calling, and the incorporation of 
quality control measures within the analysis. 

One skilled in the art will be aware that Southern blotting or Northern 
30 blotting may be used to detect pathogenic deletions or rearrangements within 
or near the sequences of the invention. Fluorescence In Situ Hybridization 
(FISH), fiber-FISH or other high-resolution cytogenetic methods may similarly 
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be used for the detection of rearrangements or deletions that disrupt the 
sequences of the invention. 

Another technique for the detection of mutations is denaturing HPLC 
analysis. Accordingly, one embodiment of the invention includes the use of a 

5 Transgenomic Wave™ machine for the dHPLC analysis of nucleic acids for 
the identification of heterozygous mutations or polymorphisms within the 
sequences of the invention. 

The invention also contemplates the use of mass spectroscopy for the 
genotyping of mutations. Mutant and wildtype nucleic acid molecules may 

10 differ in mass due to the different composition of wildtype and mutant 
sequences, allowing for the identification of mutations on the basis of the 
molecular mass of different nucleotide sequences. The use of mass 
spectroscopy, and in particular Matrix Assisted Laser Desorption lonisation 
Time of Flight (MALDI-TOF) mass spectroscopy for the genotyping of 

15 mutations is well known by those skilled in the relevant art. For example. U.S. 
Patent No. 6,043,031 describes a fast and highly accurate mass spectrometer 
based process for detecting a particular nucleic acid sequence. The 
MassARRAY"^ platform from SEQUENOM™ is an example of a 
commercially available system capable of genotyping single nucleotide 

20 polymorphisms and detecting the mutations as described in the present 
invention. 

(ii) Detecting the Malin Protein 

In another embodiment, the present invention provides a method for 
detecting Lafora's disease comprising determining if the malin protein is 
25 present or mutated in a sample from a mammal, preferably a human or dog. 
suspected of having Lafora's disease. 

The malin protein of the present invention may be detected in a 
biological sample using antibodies that are specific for malin using various 
immunoassays that are discussed below. Antibodies that only react with 
30 mutated malin would be useful as diagnostic agents to detect Lafora's 
disease. As such, antibodies would be prepared that bind only a mutated 
region of the protein. 
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Conventional methods can be used to prepare the antibodies. For 
example, by using a peptide from the malin protein of the invention, polyclonal 
antisera or monoclonal antibodies can be made using standard methods. A 
mammal, (e.g., a mouse, hamster, or rabbit) can be immunized with an 
5 immunogenic fomn of the peptide which elicits an antibody response in the 
mammal. Techniques for conferring immunogenicity on a peptide include 
conjugation to carriers or other techniques well known in the art. For 
example, the peptide can be administered in the presence of adjuvant. The 
progress of immunization can be monitored by detection of antibody titers in 
10 plasma or serum. Standard ELISA or other immunoassay procedures can be 
used with the immunogen as antigen to assess the levels of antibodies. 
Following immunization, antisera can be obtained and. if desired, polyclonal 
antibodies isolated from the sera: 

To produce monoclonal antibodies, antibody producing cells 
15 (lymphocytes) can be harvested from an immunized animal and fused with 
myeloma cells by standard somatic cell fusion procedures thus immortalizing 
these cells and yielding hybridoma cells. Such techniques are well known in 
the art, (e.g., the hybridoma technique originally developed by Kohler and 
Milstein (Nature 256, 495-497 (1975)) as well as other techniques such as the 
20 human B-cell hybridoma technique (Kozbor et al., Immunol. Today 4. 72 
(1983)), the EBV hybridoma technique to produce human monoclonal 
antibodies (Cole et al. Monoclonal Antibodies in Cancer Therapy (1985) Allen 
R. Bliss. Inc., pages 77-96). and screening of combinatorial antibody libraries 
(Huse et al., Science 246, 1275 (1989)]. Hybridoma cells can be screened 
25 immunochemically for production of antibodies specifically reactive with the 
peptide and the monoclonal antibodies can be isolated. Therefore, the 
invention also contemplates hybridoma cells secreting monoclonal antibodies 
with specificity for a protein of the invention. 

The term "antibody" as used herein is intended to include fragments 
30 thereof which also specifically react with a protein, of the invention, or peptide 
thereof.' Antibodies can be fragmented using conventional techniques and the 
fragments screened for utility in the same manner as described above. For 



-29- 



example, F(ab')2 fragments can be generated by treating antibody with 
pepsin. The resulting F(ab')2 fragment can be treated to reduce disulfide 
bridges to produce Fab' fragments. 

Chimeric antibody derivatives, i.e., antibody molecules that combine a 
5 non human animal variable region and a human constant region are also 
contemplated within the scope of the invention. Chimeric antibody molecules 
can include, for example, the antigen binding domain from an antibody of a 
mouse, rat, or other species, with human constant regions. Conventional 
methods may be used to make chimeric antibodies containing the 

10 immunoglobulin variable region which recognizes a malin protein (see, for 
example. Morrison et al., Proc. Natl Acad. Sci. U.S.A. 81,6851 (1985); 
Takeda et al.. Nature 314. 452 (1985). Cabilly et al.. U.S. Patent No. 
4,816.567; Boss et al., U.S. Patent No. 4,816.397; Tanaguchi et al., 
European Patent Publication EP171496; European Patent Publication 

15 0173494, United Kingdom patent GB 2177096B), 

Monoclonal or chimeric antibodies specifically reactive with a protein of 
the invention as described herein can be further humanized by producing 
human constant region chimeras, in which parts of the variable regions, 
particularly the conserved framework regions of the antigen-binding domain, 

20 are of human origin and only the hypervariable regions are of non-human 
origin. Such immunoglobulin molecules may be made by techniques known 
in the art. (e.g., Teng et al., Proc. Natl. Acad. Sci. U.S.A. " 80, 7308-7312 
(1983); Kozbor et al., Immunology Today, 4, 7279 (1983); Olsson et al., Meth. 
Enzymol., 92, 3-16 (1982)), and PCT Publication WO92/06193 or EP 

25 0239400). Humanized antibodies can also be commercially produced 
(Scotgen Limited, 2 Holly Road. Twickenham, Middlesex, Great Britain.) 

Specific antibodies, or antibody fragments, reactive against a protein of 
the invention may also be generated by screening expression libraries 
encoding immunoglobulin genes, or portions thereof, expressed in bacteria 

30 with peptides produced from the nucleic acid molecules of the present 
invention. For example, complete Fab fragments, VH regions and FV regions 
can be expressed in bacteria using phage expression libraries (See for 
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example Ward et al., Nature 341, 544-546: (1989); Huse et al., Science 246, 
1275-1281 (1989); and McCafferty et al. Nature 348, 552-554 (1990)). 

Antibodies may also be prepared using DNA immunization. For 
example, an expression vector containing a nucleic acid of the invention (as 
5 described above) may be injected into a suitable animal such as mouse. The 
protein of the invention will therefore be expressed in vivo and antibodies will 
be induced.. The antibodies can be isolated and prepared as described above 
for protein immunization. 

The antibodies may be labelled with a detectable marker including 
10 various enzymes, fluorescent materials, luminescent materials and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, 
biotin, alkaline phosphatase, p-galactosidase, or acetylcholinesterase; 
examples of suitable fluorescent materials include umbelliferone, fluorescein, 
fluorescein isothiocyanate, rhodamine. dichlorotriazinylamine fluorescein, 
15 dansyl chloride or phycoerythrin; an example of a luminescent material 
includes luminol; and examples of suitable radioactive material include S-35, 
Cu-64, Ga-67, Zr-89, Ru-97. Tc-99m, Rh-105, Pd-109, ln-111, 1-123, 1-125, 
1131, Re-186, Au-198, Au-199, Pb-203. At-211. "Pb-212 and Bi-212. The 
antibodies may also be labelled or conjugated to one partner of a ligand 
20 binding pair. Representative examples include avidin-biotin and riboflavin- 
riboflavin binding protein. Methods for conjugating or labelling the antibodies 
discussed above with this representative labels set forth above may be readily 
accomplished using conventional techniques. 

The antibodies reactive against proteins of the invention (e.g. enzyme 
25 conjugates or labelled derivatives) may be used to detect a protein of the 
invention iii various samples, for example they may be used in any known 
Immunoassays which rely on the binding interaction between an antigenic 
determinant of a protein of the invention and the antibodies. Examples of 
such assays are radioimmunoassays, enzyme immunoassays (e.g. ELISA), 
30 immunofluorescence, immuno-precipitation, latex agglutination, 
hemagglutination, and histochemical tests. Thus, the antibodies may be used 
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to identify or quantify the amount of a protein of tine invention in a sample in 
order to diagnose the presence of Lafora's disease. 

In a method of the invention a predetermined amount of a sample or 
concentrated sample is mixed with antibody or labelled antibody. The amount 

5 of antibody used in the process is dependent upon the labelling agent chosen. 
The resulting protein bound to antibody or labelled antibody may be isolated 
by conventional isolation techniques, for example, salting out, 
chromatography, electrophoresis, gel filtration, fractionation, absorption, 
polyacrylamide gel electrophoresis, agglutination, or combinations thereof. 

0 The sample or antibody may be insolubilized, for example, the sample 

or antibody can be reacted using known methods with a suitable carrier. 
Examples of suitable carriers are Sepharose or agarose beads. When an 
insolubilized sample or antibody is used protein bound to antibody or 
unreacted antibody is isolated by washing. For example, when the sample is 

15 blotted onto a nitrocellulose membrane, the antibody bound to a protein of the 
invention is separated from the unreacted antibody by washing with a buffer, 
for example, phosphate buffered saline (PBS) with bovine serum albumin 
(BSA). 

When labelled antibody is used, the presence of malin can be 
20 determined by measuring the amount of labelled antibody bound to a protein 
of the invention in the sample or of the unreacted labelled antibody. The 
appropriate method of measuring the labelled material is dependent upon the 
labelling agent. 

When unlabelled antibody is used in the method of the invention, the 
25 presence of malin can be determined by measuring the amount of antibody 
bound to the protein using substances that interact specifically with the 
antibody to cause agglutination or precipitation. In particular, labelled 
antibody against an antibody specific for a protein of the invention, can be 
added to the reaction mixture. The presence of a protein of the invention cah 
30 be determined by a suitable method from among the already described 
techniques depending on the type of labelling agent. The antibody against an 
antibody specific for a protein of the invention can be prepared and labelled 



-32 . 

by conventional procedures known in the art which have been described 
herein. The antibody against an antibody specific for a protein of the 
invention may be a species specific anti-immunoglobulin antibody or 
monoclonal antibody, for example, goat anti-rabbit antibody may be used to 
5 detect rabbit antibody specific for a protein of the invention. . 
(iii) Kite 

The reagents suitable for carrying out the methods of the invention may 
be packaged into convenient kits providing the necessary materials, packaged 
into suitable containers. Such kits may include all the reagents required to 

10 detect a nucleic acid molecule or protein of the invention in a sample by 
means of the methods described herein, and optionally suitable supports 
useful in performing the methods of the invention. 

In one embodiment of the invention, the kit includes primers which are 
capable of amplifying a nucleic acid molecule of the invention or a 

15 predetermined oligonucleotide fragment thereof, all the reagents required to 
produce the amplified nucleic acid molecule or predetermined fragment 
thereof in the polymerase chain reaction, and means for assaying the 
amplified sequences. The kit may also include restriction enzymes to digest 
the PGR products, in another embodiment of the invention the kit contains a 

20 nucleotide probe which hybridizes with a nucleic acid molecule of the 
invention, reagents required for hybridization of the nucleotide probe with the 
nucleic acid molecule, and directions for its use. |n a further embodiment of 
the invention the kit includes antibodies of the invention and reagents required 
for binding of the antibody to a protein of the invention in a sample. 

25 The methods and kits of the present invention may be used to detect 

Lafora's disease. Samples which may be tested include bodily materials such 
as blood, urine, serum, tears, saliva, feces, tissues, cells and the like. In 
addition to human samples, samples may be taken from mammals such as 
non-human primates and canids such as dogs. 

3.0 Before testing a sample in accordance with the methods described 

herein, the sample may be concentrated using techniques known in the art, 
such as centrifugation and filtration. For the hybridization and/or PCR-based 
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methods described herein, nucleic acids may be extracted from cell extracts 
of the test sample using techniques known in the art. 
B. £PA72e/Malin Modulators 

In addition to antibodies and antisense oligonucleotides described 
5 above, other substances that modulate EPM2B expression or activity may 
also be identified, as well as substances that modulate mutated fomms of 
malin. 

(i) Substances that Bind Malin 

Substances that affect malin activity can be identified based on their 
10 ability to bind to malin and/or mutated malin. 

Substances which can bind with the malin of the invention may be 

identified by reacting the malin with a substance which potentially binds to 

malin, and assaying for complexes, for free substance, or for non-complexed 

malin, or for activation of malin. In particular, a yeast two hybrid assay system 
15 may be used to identify proteins which interact with malin (Fields, S. and 

Song. O., 1989. Nature. 340:245-247). Systems of analysis which also may 

be used include ELISA. 

Accordingly, the invention provides a method of identifying substances 

which can bind with malin, comprising the steps of: 
20 (a) reacting malin and a test substance, under conditions which 

allow for formation of a complex between the malin and the test substance, 

and 

(b) assaying for complexes of malin and the test substance, for free 
substance or for non complexed malin, wherein the presence of complexes 

25 indicates that the test substance is capable of binding malin. 

The malin protein used in the assay may have the amino acid 
sequence shown in SEQ ID NO:2 (Figure 68) or SEQ ID NO:4 (Figure 7B) or 
may be a mutated protein associated with LD as described herein or may be a 
fragment, analog, derivative, homolog or mimetic thereof as described herein. 

30 Conditions which permit the formation of substance and malin 

complexes may be selected having regard to factors such as the nature and 
amounts of the substance and the protein. 
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The substance-protein complex, free substance or non-complexed 
proteins may be isolated by conventional isolation techniques, for example, 
salting out, chromatography, electrophoresis, gel filtration, fractionation, 
absorption, polyacrylaniide gel electrophoresis, agglutination, or combinations 
5 thereof. To facilitate the assay of the components, antibody against malin or 
the substance, or labelled malin, or a labelled substance may be utilized. The 
antibodies, proteins, or substances may be labelled with a detectable 
substance as described above. 

Malin. or the substance used in the method of the invention may be 
10 insolubilized. For example, malin or substance may be bound to a suitable 
carrier. Examples of suitable carriers are agarose, cellulose, dextran, 
Sephadex, Sepharose. carboxymethyl cellulose polystyrene, filter paper, ion- 
exchange resin, plastic film, plastic tube, glass beads, polyamine-methyl vinyl- 
ether-maleic acid copolymer, amino acid copolymer, ethylene-maleic acid 
15 copolymer, nylon, silk. etc. The carrier may be in the shape of, for example, a 
tube, test plate, beads, disc, sphere etc. 

The insolubilized protein or substance may be prepared by reacting the 
material with a suitable insoluble carrier using known chemical or physical 
. methods, for example, cyanogen bromide coupling. 
20 The proteins or substance may also be expressed on the surface of a 

cell using the methods described herein. 

The invention also contemplates assaying for an antagonist or agonist 
of the action of malin. 

It will be understood that the agonists and antagonists that can be 
25 assayed using the methods of the invention may act on one or more of the 
binding sites on the protein or substance including agonist binding sites, 
competitive antagonist binding sites, non-competitive antagonist binding sites 
or allosteric sites. 

The invention also makes it possible to screen for antagonists that 
30 inhibit the effects of an agonist of malin. Thus, the invention may be used to 
assay for a substance that competes for the same binding site of malin. 
(ii) Peptide Mimetics 
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The present invention also includes peptide nnimetics of the malin and 
mutated malin proteins of the invention. For example, a peptide derived from 
a the mutated domain of malin will interact directly or indirectly with an 
associated molecule in such a way as to mimic the native binding of the 
5 mutated protein. Such peptides may include competitive inhibitors, 
enhancers, peptide mimetics. and the like. All of these peptides as well as 
molecules substantially homologous, complementary or otherwise functionally 
or structurally equivalent to these peptides may be used for purposes of the 
present invention. 

10 "Peptide mimetics" are structures which serve as substitutes for 

peptides in interactions between molecules (See Morgan et al (1989), Ann. 
Reports Med. Chem. 24:243-252 for a review). Peptide mimetics include 
synthetic structures which may or may not contain amino acids and/or peptide 
bonds but retain the structural and functional features of a peptide, or 

15 enhancer or inhibitor of the invention. Peptide mimetics also include peptoids, 
oligopeptoids (Simon et al (1972) Proc. Natl. Acad. Sci USA 89:9367); and 
peptide libraries containing peptides of a designed length representing all 
possible sequences of amino acids corresponding to a peptide of the 
invention. 

20 Peptide mimetics may be designed based on information obtained by 

systematic replacement of L-amino acids by D-amino acids, replacement of 
side chains with groups having different electronic properties, and by 
systematic replacement of peptide bonds with amide bond replacements. 
Local conformational constraints can also be introduced to determine 

25 conformational requirements for activity of a candidate peptide mimetic. The 
mimetics may include isosteric amide bonds, or D-amino acids to stabilize or 
promote reverse turn conformations and to help stabilize the molecule. Cyclic 
amino acid analogues may be used to constrain amino acid residues to 
particular conformational states. The mimetics can also include mimics of 

30 inhibitor peptide secondary structures. These structures can model the 3- 
dimensional orientation of amino acid residues into the known secondary 
conformations of proteins. Peptoids may also be used which are oligomers of 
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N-substituted amino acids and can be used as motifs for tlie generation of 
chemically diverse libraries of novel molecules. 

Peptides of the invention may also be used to identify lead compounds 
for drug development. The structure of the peptides described herein can be 
5 readily determined by a number of methods such as NMR and X-ray 
crystallography. A comparison of the structures of peptides similar in 
sequence, but differing In the biological activities they elicit in target molecules 
can provide information about the structure-activity relationship of the target. 
Information obtained from the examination of structure-activity relationships 
1Q can be used to design either modified peptides, or other small molecules or 
lead compounds that can be tested for predicted properties as related to the 
target molecule. The activity of the lead compounds can be evaluated using 
assays similar to those described herein. 

Information about structure-activity relationships may also be obtained 
15 from co-crystallization studies. In these studies, a peptide with a desired 
activity is crystallized in association with a target molecule, and the X-ray 
structure of the complex is determined. The structure can then be compared 
to the structure of the target molecule in its native state, and information from 
such a comparison may be used to design compounds expected to possess. 
20 (ill) Drue Screenino Methods 

In accordance with one embodiment, the invention enables a method 
for screening candidate compounds for their ability to increase or decrease 
the activity of the mutated malin protein. The method comprises providing an 
assay system for assaying malin activity, assaying the activity in the presence 
25 or absence of the candidate or test compound and determining whether the 
compound has increased or decreased malin activity. Such compounds may 
be useful in treating Lafora's disease. 

Accordingly, the present invention provides a method for identifying a 
compound that affects mutated malin protein activity or expression 
30 comprising: 

(a) incubating a test compound with a malin protein or a nucleic acid 
encoding a malin protein; and 
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(b) determining an amount of malin protein activity or expression and 
comparing with a control (i.e. in the absence of the test substance), wherein a 
change in the malin protein activity or expression as compared to the control 
indicates that the test compound has an effect on malin protein activity or 
5 expression. 

In accordance with a further embodiment, the invention enables a 
method for screening candidate compounds for their ability to increase or 
decrease expression of a malin protein. The method comprises putting a cell 
with a candidate compound, wherein the cell includes a regulatory region of a 

10 malin gene operably joined to a reporter gene coding region, and detecting a 
change in expression of the reporter gene. 

In one embodiment, the present invention enables culture systems in 
which cell lines which express the mutated malin gene are incubated with 
candidate compounds to test their effects on mutated malin expression. Such 

15 culture systems can be used to identify compounds which upregulate or 
downregulate malin expression or its function, through the interaction with 
other proteins. 

Such compounds can be selected from protein compounds, chemicals 
and various drugs that are added to the culture medium. After a period of 

20 incubation in the presence of a selected test compound(s), the expression of 
mutated malin can be examined by quantifying the levels of malin mRNA 
using standard Northem blotting procedure, as described in the examples 
included herein, to detemiine .any changes in expression as a result of the test 
compound. Cell lines transfected with constructs expressing malin can also 

25 be used to test the function of compounds developed to modify the protein 
expression. 
C. Therapeutic Uses 

As previously discussed, the EPM2B gene and malin of the invention is 
likely involved in Lafora's disease. Accordingly, the present invention 

30 provides a method of treating Lafora's disease comprising of administering to 
a cell or animal in need thereof, an effective amount of agent that modulates 
£PM2S/malin expression and/or activity. The present invention also provides 
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a use of an agent that modulates £PM2S/malin expression and/or activity to 
treat Lafora's disease or to prepare a medicament to treat Lafora's disease. 

The term "agent that modulates £PM28/malin expression and/or 
activity" means any substance that can alter the expression and/or activity of 

5 the mutated EPM2fi/malin found in the animal to be consistent v»;ith the wild 
type EPM2B/malin. Examples of agents which may be used to include 
administering: a nucleic acid molecule encoding wild type EPM2B; the wild 
type malin protein as well as fragments, analogs, derivatives or homologs 
thereof; antibodies; antisense nucleic acids; peptide mimetics; and 

10 substances isolated using the screening methods described herein that can 
correct the mutation to result in £PM2e/malin levels and/or function consistent 
with a person without the disease. 

The term "effective amounf as used herein means an amount 
effective, at dosages and for periods of time necessary to achieve the desired 

15 results. 

The term "animal" as used herein includes all members of the animal 
kingdom, including humans and dogs. 

In one embodiment, the invention provides a method of treating 
Lafora's disease by administering to a cell or animal an effective amount of an 
20 agent that modulates the expression or the biological activity of the mutated 
malin protein. The present invention also provides a use of an effective 
amount of an agent that modulates the expression or the biological activity of 
the mutated malin protein to treat Lafora's disease or to prepare a 
medicament to treat Lafora's disease. Substances that inhibit the activity of 
25 mutated malin include peptide mimetics, malin antagonists and certain 
antibodies to malin. Substances that inhibit the expression of the mutated 
EPM2B gene include antisense oligonucleotides to a mutated EPM2B nucleic 
acid sequence. 

In accordance with another embodiment, the present invention enables 
30 gene therapy as a potential therapeutic approach to Lafora's disease, in which 
normal copies of the EPM2B gene are introduced into patients to successfully 
code for normal malin protein in several different affected cell types. 
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Retroviral vectors can be used for somatic cell gene therapy especially 
because of their high efficiency of infection and stable integration and 
expression. The targeted cells however must be able to divide and the 
expression of the levels of normal protein should be high. The full length 
5 normal EPM2B gene can be cloned into a retroviral vector and driven from its 
endogenous promoter or from the retroviral long terminal repeat or from a 
promoter specific for the target cell type of interest. Other viral vectors which 
can be used include adeno-associated virus, vaccinia virus, bovine papilloma 
virus, or a herpesvirus such as Epstein-Barr virus. Gene transfer could also 
10 be achieved using non-viral means requiring infection in vitro. This would 
include calcium phosphate, DEAE dextran, electroporation. cationic or anionic 
lipid fomiulations (liposomes) and protoplast fusion. Although these methods 
are available, many of these are lower efficiency. 

Anti-sense based strategies can be employed to inhibit mutated 
15 EPM2B gene function and as a basis for therapeutic drug design. The 
principle is based on the hypothesis that sequence specific suppression of 
gene expression can be achieved by intracellular hybridization between 
mRNA and a complementary anti-sense species. It is possible to synthesize 
anti-sense strand nucleotides that bind the sense strand of RNA or DNA with 
20 a high degree of specificity. The formation of a hybrid RNA duplex may 
interfere with the processing/transport/translation and/or stability of a target 
mRNA. 

Hybridization is required for an antisense effect to occur. Antisense 
effects have been described using a variety of approaches including the use 

25 of antisense oligonucleotides, injection of antisense RNA, DNA and 
transfection of antisense RNA expression vectors. 

Therapeutic antisense nucleotides can be made as oligonucleotides or 
expressed nucleotides. Oligonucleotides are short single strands of DNA 
which are usually 15 to 20 nucleic acid bases long. Expressed nucleotides 

30 are made by an expression vector such as an adenoviral, retroviral or plasmid 
vector. The vector is administered to the cells in culture, or to a patient, 
whose cells then make the antisense nucleotide. Expression vectors can be 
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designed to produce antisense RNA, which can vary in length from a few 

dozen bases to several thousand. 

Antisense effects can be induced by control (sense) sequences. The 

extent of phenotypic changes are highly variable. Phenotypic effects induced 
5 by antisense are based on changes in criteria such as biological endpoints, 

protein levels, protein activation measurement and target mRNA levels. 

D. Pharmaceutical Compositions 

The above described substances including nucleic acids encoding 

EPM2B and mutated EPM2B, malin and mutated malin proteins, antibodies, 
10 and antisense oligonucleotides as well as other agents that modulate 

.EP/W2S/malin and/or mutated £PM2S/malin may be formulated into 

pharmaceutical compositions for administration to subjects in a biologically 

compatible form suitable for administration in vivo. By "biologically compatible 

form suitable for administration in vivo" is meant a form of the substance to be 
15 administered in which any toxic effects are outweighed by the therapeutic 

effects. The substances may be administered to living organisms including 

humans, and animals. 

Administration of a therapeutically active amount of pharmaceutical 

compositions of the present invention is defined as an amount effective, at 
20 dosages and for periods of time necessary to achieve the desired result. For 

example, a therapeutically active amount of a substance may vary according 

to factors such as the disease state, age. sex, and weight of the individual, 

and the ability of the substance to elicit a desired response in the individual. 

Dosage regimes may be adjusted to provide the optimum therapeutic 
25 response. For example, several divided doses may be administered daily or 

the dose may be proportionally reduced as indicated by the exigencies of the 

therapeutic situation. 

An active substance may be administered in a convenient manner such 
as by injection (subcutaneous, intravenous, etc.), oral administration, 
30 inhalation, transdermal application, or rectal administration. Depending on the 
route of administration, the active substance may be coated in a material to 
protect the compound from the action of enzymes, acids and other natural 
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conditions which may inactivate the compound. If the active substance is a 
nucleic acid encoding, for example, a modified EPM2B gene may be delivered 
. using techniques known in the art. 

The compositions described herein can be prepared by per se known 

5 methods for the preparation of pharmaceutically acceptable compositions 
which can be administered to subjects, such that an effective quantity of the 
active substance is combined in a mixture with a pharmaceutically acceptable 
vehicle. Suitable vehicles are described, for example, in Remington*s 
Pharmaceutical Sciences (Remington's Pharmaceutical Sciences. Mack 

10 Publishing Company, Easton, Pa., USA 1985) or Handbook of 
Pharmaceutical Additives (compiled by Michael and Irene Ash, Gower 
Publishing Limited. Aldershot. England (1995)). On this basis, the 
compositions include, albeit not exclusively, solutions of the substances in 
association with one or more pharmaceutically acceptable vehicles or 

15 diluents, and may be contained in buffered solutions with a suitable pH and/or 
be iso-osmotic with physiological fluids. In this regard, reference can be 
made to U.S. Patent No. 5,843,456. As will also be appreciated by those 
skilled, administration of substances described herein may be by an inactive 
viral carrier. 

20 E. Experimental Models 

The present invention also includes methods and experimental models 
for studying the function of the EPM2B gene and malin protein. Cells, tissues 
and non-human animals that lack the EPM2B gene or partially lack in malin 
expression may be developed using recombinant expression vectors having a 

25 specific deletion or mutation in the EPM2B gene. A recombinant expression 
vector may be used to inactivate or alter the EPM2B gene by homologous 
recombination and thereby create an EPM2B deficient cell, tissue or animal. 

Null alleles may be generated in cells, such as embryonic stem cells by 
deletion mutation. A recombinant EPM2B gene may also be engineered to 

30 contain an insertion mutation which inactivates EPM2B. Such a construct 
may then be introduced into a cell, such as an embryonic stem cell, by a 
technique such as transfection. electroporation, injection etc. Cells lacking an 
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intact EPM2B gene may then be identified, for example by Southern blotting, 
Northern Blotting or by assaying for EPM2B using the methods described 
herein. Such cells may then be fused to embryonic stem cells to generate 
transgenic non-human animals deficient in EPM2B. Germline transmission of 

.5 the mutation may be achieved, for example, by aggregating the embryonic 
stem cells with early stage embryos, such as 8 cell embryos, in vitro; 
transferring the resulting blastocysts into recipient females and; generating 
germline transmission of the resulting aggregation chimeras. Such a mutant 
animal may be used to define specific cell populations, developmental 

10 patterns and in vivo processes, nomnally dependent on EPM2B expression. 
The present invention also includes the preparation of tissue specific knock- 
. . outs of the EPI\/I2B gene. 

The following non-limiting example is illustrative of the present 
invention: 

15 EXAMPLES 
Example 1 

The inventors examined all available databases and found 7 annotated 
genes in the newly defined critical region on chromosome 6p22.3 (Figure 1). 
Based on their predicted functional characteristics each gene was prioritized 

20 for mutation screening through DNA sequencing of LD patients canning an 
EPM2B haplotype, but no pathogenic variants were identified. 
Simultaneously, the inventors' analysis led to the discovery of a previously 
uncharacterized apparently single-exon (1188 bp) gene sharing extensive 
sequence identity with orthologous units (with equivalent protein-coding 

25 potential) in other higher vertebrates (75%, 78%, 87% nucleotide identity with 
rat. mouse, and dog, respectively). The human gene, designated as EPM2B, 
also had at its 5'-end all of the proposed features of the consensus sequence 
of an eukaryotic translational initiation site and, at its 3'-end, two putative 
polyadenylation signals (Figure 1). Moreover, expressed sequence tag (EST). 

30 and cDNA data in human and mouse supported that the single-exon unit was 
a bona fide gene. 
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EPM2B would be predicted to encode a 395 amino acid (aa) protein 
that the inventors have named malin {mal for seizure in French) containing a 
zinc finger of the RING type in the N-terminai half and 6 NHL-repeat donnains 
in the C-terminal direction (Figure 1). Specifically, the zinc-binding RING- 
5 finger motif (C-X2-C-X16-C-X1-H-X2-C-X2-C-X14-C-X2-C) was identified (E- 
value of 0.0067) between residues 26-71 of malin consistent with the 
signature sequence (C-X2-C-X9-39-C-Xi.3-H-X2.3-H-X2-C-X4^8-C-X2-C) of the 
RiNG-HC type (16,17). The presence of a RING finger is predictive of an E3 
ubiquitin ligase function (8,9,18). E3 ligation is the final and specific step of 
10 the ubiquitin pathway transferring ubiquitin from E2, either directly or through 
adaptor proteins, to a specific substrate(s) to initiate its removal by the 
proteasome system (8). The 6 NHL domains (10-13) were predicted on the 
basis of presence of an approximately 44-residue motif rich in glycine and 
hydrophobic amino acids seeded with a cluster of charged residues (Pfam 
15 detected six trusted matches for NHL domains with E-values ranging from 
0.011 to 3.5) (Figure 1). 

Northern-blot analysis indicated EPM2B is present (as at least two 
transcripts 1.5kb and 2.4kb in size) in all tissues examined including specific 
sub-regions of the brain (Figure 3). The observed transcript sizes correspond 
20 near to the lengths expected between the predicted ATG-start site and the 
two different polyadenylation signals (Figure 1). Moreover, the expression 
profile was similar to that observed for EPM2A, both being present in all 
tissues in which Lafora bodies have been obsen/ed (2,19). 

The complete coding region of EPM2B was sequenced in a cohort of 
25 34 LD probands previously shown not to carry mutations in EPM2A. In this 
Example, 17 different DNA sequence alterations are described in EPM2B in 
26 families including 8 deletions and 1 insertion leading to frame-shifts, 7 
missense, and 1 non-sense change (Table 1). These mutations were found in 
families in both homozygous (18) and compound heterozygous (8) recessive 
30 states. The four consanguineous F-C families used in the original linkage 
study all carried a homozygous 76T->'A change producing a cysteine-to- 
serine alteration in one of the 7 conserved cysteine residues that are critical 
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for the zinc-binding ability of the RING-finger domain (Figure 2b). The most 
common mutation identified (7 families) was a homozygous 205C-*G 
transition resulting In a proline to alanine change in the RING-finger domain. 
The LD38324 family that was critical In refining the EPM2B locus (Figure 2a) 

5 carried a homozygous 2-bp deletion (1 048-1 049delGA) leading to a frame- 
shift mutation in the fifth NHL-domaIn (Figure 2b). Referring to Table 1, In 
family LD51. DNA from the proband was not available but the parents were 
both heterozygous earners of 468-469delAG, which would be predicted to 
lead to homozygous frame-shift mutations in the LD child. All of the mutations 

10 detected would affect the putative RING or NHL motifs, or would be predicted 
to lead to a frame-shift or cause drastic structural change In the protein 
(LD483 carries a 260T-^C nucleotide change which would lead to a leucine to 
proline alteration). Four silent DNA sequence-coding variants were identified. 
Three of them T312C (H104H). G372C (G124G) and T1020C (G34QG) were 

15 present In five, two, and one of 100 control chromosomes, respectively. The 
most common polymorphism detected, C332T (P111L) (Figure 1) was 
observed on 42 of 100 control chromosomes. 

In total, 88% or the LD families can now be accounted for by mutations 
In EPM2A (48%) and EPM2B (40%). The observation of 8 families with no 

20 detectable mutations in either of these genes suggests there could be 
additional LD loci. 

Among other conditions with polyglucosan accumulation, including 
adult polyglucosan body disease (APBD) (20). LD is unique for the sub- 
cellular location of inclusions in neuronal dendrites but not axons (21), as is 

25 show in Figure 4. The physical association of the forming polyglucosan fibrils 
with ER Is also specific to LD (3). In APBD, which is caused by mutations in 
the glycogen branching enzyme (20,22,23), polyglucosans are 
indistinguishable in size, composition, and number from Lafora bodies, but 
they are located exclusively In the cell soma and axons (20). The presence of 

30 a seizure phenotype in LD but not in APBD implicates ER-associated dendritic 
accumulations of polyglucosans in the epilepsy of LD. 
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To examine the cellular localization of malin the inventors transfected 
an epitope-tagged EPM2B construct and found that it did indeed localize at 
the ER and to a lesser extent within the nucleus of cultured cells (Figure 5). 
These results were similar to the cellular localization observed for the two 
5 alternative transcripts (A and B) of EPM2A, which encode isoforms of laforin 
found in the cytoplasm at the ER (24-26) (Figure 4b) and in the nucleus, 
respectively (27). The inventors most recent data implicate loss of function of 
the cytoplasmic form of laforin in LD based on the identification of transcript A- 
specific mutations (L.I. et al., manuscript submitted). Moreover, the study of 
10 murine-£PM2/\ knockouts (28) and LD patients (1,2) has shown that the ER- 
associated polyglucosan bodies precede or are concomitant, respectively, 
with onset of epilepsy. 

Therefore, in the simplest explanation, LD arises due to improper 
clearance, and subsequent accumulation of polyglucosans in dendrites. 
15 disturbing neuronal synaptic function leading to epileptogenesis. The 
inventors have now shown in transgenic LD mice that laforin contacts 
polyglucosans (and not glycogen) providing the first physical link between a 
disease gene product and LD pathology (E.M.C. et al.. in preparation). 
Laforin's only other experimentally-validated function is that of a dual- 
20 specificity phosphatase (24,25), which would predict that there is at least one 
phosphoprotein intermediary through which it acts. Possible candidates could 
be the newly discovered EPM2AIP1 laforin-interacting protein (26), or other 
still to be cloned LD gene(s), or any of their interacting proteins. Malin will 
likely be involved via specific protein-protein interaction through its NHL 
25 domains followed by . ubiquitin-mediated removal of a regulatory target(s), 
contributing a crucial role with laforin to safeguard neurons against Lafora 
bodies and epilepsy. 
METHODS 
Samples 

30 All patients described in this study were formally diagnosed with 

adolescent-onset progressive myoclonus epilepsy based on presence of 
pathognomonic Lafora bodies in biopsies of skin, skeletal muscle, liver or 
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brain. Each LD individual and their family members (if available) were 
examined for involvement of the EPM2A locus at 6q24 by homozygosity 
mapping, mutation screening, or both. 
Genotyping and Mutation Screening 

5 Infomiation and amplification conditions on the established {D6S274, 

D6S2B5, D6S966, D6S1567. D6S1678, D6S1688, D6S1959) and new 
{BV012563, BV012730. BV012565, BV012566, BV012568) are found in the 
UniSTS and Entrez Nucleotides database (http://www.ncbi.nim.nih.gov/). DNA 
Sequence variations were detected by sequencing of PCR-products. To 
10 screen EPM2B, two sets of primer pairs that amplify overlapping fragments 
were used EPM2B-: F: (5'-ACTGTGACCGTG ACCGAGA-3') and EPM2BA R: 
(CACACCCCAAGGTAAGGAGA-3'); EPM2B-2?: (5'- 

GACTGCCATGTGGTTGTCAC-3') and EPM2B-2R: (5'- 

AAACAATTCATTAATGGCAGCA-3') (see Figure 1). PGR was performed on 
15 50 ng of DNA in buffer [75 mM Tris-HCI (pH 8.8). 20 mM (NH4)2S04, 0.01% 
Tween 20, 1.5 mM MgC12, 1M Betaine, 0.2 mM dNTP. 0.2 uM of each primer, 
2.5 Units of Taq Polymerase (MBI Fermentas)]. Cycling conditions were: initial 
denaturation at 94X for 3 min followed by 35 cycles of denaturation at 94°C 
for 30 sec, annealing at 60°C for 30 sec and extension at 72°C for 30 sec, 
20 with a 10 min final extension at 72°C. PGR products were purified using 
mircroCLEAN (Microzone Ltd). 3mI (100ng/|jl) of purified PGR product was 
used as sequencing template. For all reactions, 1|j1 (5 pmol) of primer, 1.5pl 
5X sequencing buffer (Applied Biosystems), Ipl BigDye Terminator v3.1, and 
7.5jj| H20 in a 14^1 reaction volume were used. Thermocycling (MJ 
25 Research, Inc.) conditions were denaturing at 96°C for 30s; annealing at 50° 
for 20s; and extension at 60°G for 4 min; 35 cycles. All reactions were 
subsequently purified using multiscreen-HV filter plates (Millipore) and 
analyzed using an ABI-3700. All sequence variants detected in LD patients 
were examined in a collection of 50 (100 chromosomes) randomly selected 
30 DNA samples. 

Gene identification and Northern blots 
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Gene annotation data and EST sequences were mainly obtained from 
the University of California Santa Cruz (UCSC) genome bro>A«er 
(http://genome.ucsc.edu/) and Celera Genomics (http://www.celera.com/). 
Additional putative genes were annotated using the Genescript algorithm 

5 (http://tcag.bioinfo.sickkids.on.ca/genescript/) and multi-species VISTA 
(http://www-gsd.lbl.gov/vista/) alignments between human and mouse 
sequence. The RING finger domain was predicted by Pfam. Prosite, 
InterProScan, SMART and MotifScah. NHL domains were identified using 
Pfam and InterProScan. EPM2B orthologues were identified using BLASTN 

10 and BLASTP analyses against the GenBank non-redundant database. 
Prediction of the sub-cellular localization of malin by sequence analysis was 
performed using PSORT II. No significant signal peptide sequence (for 
recognition of ER. Golgi complex, lysosome and integral plasma membrane 
. proteins), mitochondrial targeting sequence, nuclear localization signal and 

15 peroxisomal targeting signal was identified. The human multiple-tissue blot I 
and human brain blot II (Clontech) were probed with a p^PidCTP-labeled 
probe that was generated using the primers 5'- 
GTCACCATCACCAACGACTG-3' and 5'-TGCGAAAGACCATGAGTGAC-3', 
which amplified a 557bp fragment within the coding region of EPM2B. 

20 Hybridization and washing conditions were perfomied according to the 
manufacturer's instructions. 

Sub-cellular localization and Electron Microscopy 

The myc-tagged EPM2A transcript A expression construct 
(pcDNA3mycEPM2A), which encodes the cytoplasmic isoform of laforin has 

25 been described (24). A myc-tagged EPM2B construct was generated 
(pcDNA3myc£PiW2e) using the same general protocols. Full-length EPM2B 
was amplified by PGR from genomic DNA using the (fonward) primer (5'- 
ggatccATGgcggccgaagc-3') containing a BamH\ restriction site (underlined) 
and the start codon (uppercase) and a (reverse) primer (5'- 

30 gcgaccgcacaattcattaatggcagac-3') containing a Notl site (underlined). This 
product was cloned into the corresponding sites of the mammaHan expression 
vector pcDNA3 (Invitrogen). Myc was then introduced, in frame, after 
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amplifying from a previous myc-containing vector with 5' KpnI-tagged and 3' 
SamHI-tagged primers. pcDNA3mycEPM2/\ and pcDNA3mycEP/W26 (2 mg) 
were transfected into Cos-7 cells using Lipofectamine-Plus (Invitrogen) and 
exposed to lipid-DNA complex in DMEM (Sigma-Aldrich) for 5 hours. Forty- 
eight hours post-transfection, cultures were rinsed twice in PBS and fixed for 
15 min at -20''C in an acetone.methanol (1:1) mix. They were then stained 
with antibodies against myc-laforin and ER marl<er GRP94. Cultures were 
blocked for 1 hour (10% BSA/PBS) and incubated with anti-Myc and anti-ER 
for 45 min at room terriperature. Slides were washed with PBS and incubated 
with secondary antibody (FITC-labeled goat anti-mouse, 1:400, detectable 
through the green filter; Texas red-labeled donkey anti-goat, 1:400, detectable 
through the red filter; Jackson ImmunoResearch Laboratories) in blocking 
solution. Following mounting (Dako Anti-Fade), they were analyzed by 
immunofluorescence light microscopy. For electron microscopic examination 
biopsy material was obtained from the LD patient and placed into chilled 
Universal fixative. Using standard protocols it was then analyzed at the ultra- 
structural level. 
Example 2 

In addition to the mutations described in Example 1, the inventors have 
found an additional 4 mutations in EPM2B that is associated with Lafora's 
disease. These mutations were described in the same fashion as Example 1. 
The DNA was obtained from blood from patients with Lafora disease clinically 
similar to the ones described in Example 1. The three mutations in EPM2B 
are as follows: 1) A deletion of nucleotide T at position 606; 2) An A to T 
change at nucleotide 923; 3) A G to T change at nucleotide 580; and 4) A G to 
T change at nucleotide 199. The latter mutation was found in both a 
Sudanese and Italian family. The additional mutations are listed at the bottom 
of Table! 
Example 3 

Lafora's disease has also been discovered in dogs with surprising 
frequency. Progressive myoclonus epilepsy (PME), is common (~5%) in the 
popular Miniature Wirehalred Dachshund (MWHD) breed in the United 
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Kingdom. The inventors characterized the clinicopathologic phenotype of 
these dogs and show them to have LD. The inventors mapped the underlying 
disease locus and identified the specific disease-associated mutation (in the 
EPM2B gene): the first coding dodecamer repeat expansion in any species 
5 and the first disease-causing tandem repeat expansion outside human. The 
inventors have shown that the expansion mutation is recurrent, affecting 
epileptic dogs other than MWHD and demonstrate that it arises from a 
sequence variation particular to dogs (and other canids), predisposing the 
species to LD. Finally, the inventors devised a test to detect expanded alleles 

10 in carrier and presymptomatic animals and allow eradication of the disease 
' from MWHD and future affected canine populations. 

Fourteen affected MWHD with age of onset between six to nine years 
were initially evaluated. Myoclonus consisted of symmetrical split-second 
contractions of neck and limb muscles causing retropulsion of the dog; jaws 

15 would chatter or snap shut, and eyelids would blink rapidly. Myoclonus 
. occurred spontaneously but could be triggered by sudden noises, or visual 
stimuli such as a ball rolling towards the dog or the strobe of light through 
trees during a car ride. Myoclonus was less likely when the dog was focused 
such as during chasing after rabbits and more likely with excitement or 

20 nervousness such as at feeding time or with visitors. Atonic attacks causing 
the animal to suddenly sit or fall and generalized seizures with rigidity, 
paddling, vocalization, foaming at the mouth, urination and unconsciousness 
were common. Older dogs were ataxic and blind, their seizures more 
frequent and unresponsive to medications. Eventually (ages 9 to 12), 

25 myoclonus and drop attacks were constant, compelling euthanasia. This 
clinical picture precisely recapitulates human LD (2), as does the underlying 
pathology (Figure 8). 

Having confirmed that the dogs have LD, the inventors first cloned the 
canine EPM2A gene, and excluded its involvement. The inventors next 

30 undertook a genome-wide scan to localize the disease locus. The inventors 
genotyped 241 canine-specific microsatellite markers (31) spanning the entire 
canine genome in the 14 affected dogs and four unaffected relatives. The two 
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markers at which the largest number of affected dogs were homozygous and 
the largest number of unaffected dogs heterozygous were REN94K23 and 
REN01G01, which map to the small 38Mb chromosome 35 (CFA35). The 
inventors ascertained more dogs from the extended pedigree shown in Figure 
5 9a, and genotyped additional markers on CFA35. A maximum two-point LOD 
score of 2.65 at a recombination fraction 6=0.00 was obtained for marker 
REN157J09. Multipoint linkage analysis generated a maximum LOD score of 
3.38 across five markers including REN157J09 (surpassing the significance 
threshold of 3.2 for canine linkage studies (32) (Figure 9B), thereby linking 
10 MWHD LD to this region of CFA35. 

As discussed in Examplel, the inventors mapped the human EPM2B 
locus to chromosomal band 6p22 (15). Canine CFA35 is syntenic, in its 
entirety, to human 6p21.33-6p25.2 (31), and it was therefore possible that the 
same gene Is mutated in human EPM2B patients and MWHD LD cases. The 
15 inventors cloned human EPM2B first (33), used its single exon to probe the 
RPCI-81 canine bacterial artificial chromosome (BAG) library and isolated 
BAG 328A12, which upon sequencing using human EPM2B primers revealed 
the complete open reading frame of canine EPM2B (AY560905). Radiation 
hybrid mapping linked EPM2B to marker REN157J09 (LOD score 19.3). the 
20 same marker to which MWHD LD had linked (above) with maximum LOD 
score (Figure 9b). In aggregate, these data argued that EPM2B was a viable 
candidate for the MWHD disease. 

PGR amplification of the canine EPM2B gene initially failed in affected 
but not unaffected MWHD across a region in the 5' half of the gene. 
25 Examination of the normal sequence in this region revealed two consecutive 
identical dodecamers, and a third copy differing by a single nucleotide (the 
perfectly repeated copy is termed D and the third imperfect copy T, for ease of 
reference) (Figure 10a). The corresponding region in other sequenced 
species (Figure 10a) is not repetitive and the canine sequence is longer by 12 
30 nucleotides, the length of one D repeat (Figure 10a). Concerted modifications 
of PGR conditions to amplify and sequence across this region in affected 
animals ultimately succeeded and revealed the MWHD LD mutation: all 
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affected animals had bi-allelic expansions of the dodecamer repeat, with 1 9 to 
26 copies of the D sequence in each allele (Figures 9a, 10b). The expansions 
are in-frame and might permit protein, albeit abnormal, to be made if mRNA 
containing them are stable. 

5 The inventors next tested the effect of the expansion mutation on 

mRNA stability by comparing the amount of EPM2B mRNA in skeletal muscle 
from three affected dogs and two controls using quantitative RT-PCR 
(Supplementary Methods). EPM2B mRNA levels were more than 900 times 
reduced In the affected animals (Figure 10c). 

10 The extra D sequence characterizing canine EPM2B was initially 

detected in the BAG, of Doberman Pinscher origin, and in the normal MWHD, 
The inventors considered whether this is a feature of only some breeds of 
dog, predisposing them, at least in the case of the MWHD, to expansion and 
LD. The inventors sequenced EPM2B from two normal unrelated dogs from 

15 each of 128 different breeds. Sixty percent of chromosomes had three 
repeats and 40% two repeats. Alleles with three repeats consisted always of 
two D's and one T and those with two repeats of one D and one T. Almost all 
breeds had examples of both variants, in homozygous or heterozygous state. 
Inheritance, tested in three four-generation families of different breeds, was 

20 Mendelian. 

Having shown that the dodecamer repeat is present across the canine 
species, the inventors considered whether its expansion is a cause of 
myoclonic seizures in other breeds of dog. The inventors tested the next non- 
MWHD PME case to present to the clinic, a Bassett Hound, and found a 

25 homozygous seven-fold expansion (14 copies) of the D repeats (Figure 1 0b). 

The sequence of the dodecamer expansion mutation is composed 
exclusively of G and C nucleotides. In the presence of the normal allele, PGR 
of the expanded allele was impossible, and hence carriers could not be 
determined. To detect carriers, the inventors deaminated the DNA, converting 

30 unmethyiated cytosines to uracils, and then PGR-amplified. which allowed 
amplification of the mutant allele, and reliable detection of carrier MWHD 
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(Figure 3b). Deamination also greatly facilitated PGR amplification and 
diagnosis of affected animals. 

The deamination method employed above was originally developed 
(34) for the amplification of an until now unique dodecamer repeat expansion 

5 mutation arising from a two to three-copy dodecamer repeat sequence 
specific to the human cystatin-B gene promoter and causing EPM1, another 
form of PME, by impeding cystatin-B expression (35). In humans, the more 
than 60 mutations in the two LD genes {EPM2A and EPM2B) 
(http://projects.tcag.ca/lafora/) are not the most common cause of PME. 

10 Instead, the single, recurring, cystatin-B promoter expansion mutation causes 
most human PME. To the inventors' knowledge, EPM1 has never been 
described in dogs, likely because they do not have the cystatin-B dodecamer 
repeat. LD, on the other hand is regularly reported in dogs (Basset Hounds 
(36,37,38), Miniature (36) and Standard (39) Poodles, Pointers (40), Corgis 

15 (41), Beagles (42,43,44) and MWHD (45,46). This is likely due to recurrent 
expansion events of the EPM2B repeat, combined with the intensive 
inbreeding so prevalent in the domesticated dog. In other domesticated 
animals LD is exceedingly rare: two cows identified during mass testing for 
bovine spongiform encephalopathy (47) (which clinically resembles LD) and 

20 one cat (48). 

Presence of the dodecamer repeat across canine breed barriers (49) 
suggests that its origin predates dogs and that it might therefore be present in 
related species. The inventors determined the distribution of the repeat in the 
order Carnivore and show that it is specific to the Canidae family (wolves, 

25 dogs, foxes, coyotes and jackals) (Figure lOd). The inventors were also able 
to discern the repeat's evolution, which occurred in two steps: appearance of 
the D sequence and then its duplication. Carnivores separated into Felidae 
and Canoidae in the Paleocene -60 million years ago (Ma) (50,51). The D 
sequence is present across extant Canoidae but not Felidae (Figure 10d). It 

30 therefore first appeared after the feiine-canoid split and before the subsequent 
divergence of canoids into Arctoidae and Canidae ~50Ma (Figure lOd). 
Arctoids flourished on the Eurasian continent where they evolved into extant 
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bears, skunks, racoons, otters, etc. (50). These species have a single copy of 
the D sequence (Figure 10d). Canids occupied the equivalent niches on the 
North American landmass (successfully defending it against arctoid incursions 
for 40 million years) (50), and evolved by ~10Ma to present-day wolves, dogs. 

5 foxes, coyotes and jackals (51), all of which have the duplicated D sequence 
(Figure 10d). Duplication of the D sequence therefore occurred in the canid 
evolutionary line, sometime between 50Ma and 10Ma. Whether the resulting 
variable four-amino acid lengthening of the middle portion of malin (Figure 
10a) confers a property to this E3 ligase advantageous to canids remains to 

10 be seen. 

Epilepsy is the most common neurological disorder of dogs. Its 
prevalence is at least five times higher than in human (52). Here, the 
inventors have discovered the first canine epilepsy mutation. To what extent 
it contributes to other forms of canine epilepsy remains to be determined. 

15 In the United Kingdom, LD is presently widespread in the MWHD, a 

breed with -800 dogs registered each year. Through popular carrier sires 
and repeatedly bred carrier champions the disease has spread to South Africa 
and North America (data not shown). In the United States, different 
dachshund coat types are considered varieties of the same breed and are 

20 intercrossed, which might spread the disease throughout the dachshund. The 
mutation detection test resulting from the work described here should instead 
allow its eradication through controlled breeding. Meanwhile, these animals 
are loved pets and receive neurologic care. They far outnumber human LD 
patients followed in any one medical centre. This is affording the inventors 

25 important experience with best treatments for tinis disease, which the 
inventors are applying to human patients. 

While the present invention has been described with reference to what 
are presently considered to be the preferred examples, it is to be understood 
that the invention is not limited to the disclosed examples. To the contrary, the 

30 invention is intended to cover various modifications and equivalent 
arrangements included within the spirit and scope of the appended claims. 
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All publications, patents and patent applications are herein 
incorporated by reference in their entirety to the same extent as if each 
individual publication, patent or patent application was specifically and 
individually indicated to be incorporated by reference in its entirety. 
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Table 1. £/>M2£ mutations identified in LD patients. 



F&miiy 


Origin 


Nucleotide change 


Predicted Effect 


LD6 


F-Canada 


Homozygous 76T-^A 


C26S (missense RING finger) 


LD7 


F-Canada 


Homozygous 76T— 


C26S (missense RING finger) 


LD27 


F-Canada 


Homozygous 76T->A 


C26S (missense RING finger) 


LD28 


F-Canada 


Homozygous 76T— >A 


C26S (missense RING finger) 


LD23102 


Brazil 


Homozygous 205C->G 


P69A (missense RING finger) 


LDlafl 00 


Brazil 


Homozvffous 205C-^G 


P69A (missense RING finger) 


LD5922 


Italy 


Homozygous 205C-^G 


P69A (missense RING finger) 




Italy 


Heterozygous zvdk^—tkj 
838G-^A 


P69A Tmissense RING fineer') 
E279K (missense NHL 4) 


L^L/*r 1 O i O 


Spain 


Heterozygous 2U5C-^u 


P6QA fmissense RING fineer^ 
G158fsl73 (fi-ameshift) 


XmlLJJ J 1 O V 


F-Canada 


Heterozygous zUDi^— 
204delC 


P69A (missense RING finger) 
P69fs21 (frameshifl RING finger) 




United States 


Heterozygous ^UD<-— 
468-469delAG 


P69A f missense RING finger) 

G158fsl6(frameshift) 


T n'^4477 


India 


Heterozygous 468-469delAG 

676C->T 


G158fsl6 (frameshifl) 
Q226X (nonsense NHL 3) 


T nsi 


Brazil 


Homozygous 468-469delAG 


G158fs 16 (frameshifl) 




Italy 


Homozygous 468-469delAG 


G158fsl6(ft'ameshifl) 




Ynjyftslavia 


Heterozygous 992delG 
468-469delAG 


G321fs2 (fi-ameshift NHL 5) 
G158fsl6(frameshift) 




Yueoslavia 


Heterozygous 992delG 
468-469delAG 


G321fs2 (fi-ameshift NHL 5) 
G158fsl6(fi-ameshifl) 




Bosnia 


Homozygous 992delG 


G321fs2 (frameshift NHL 5) 


LD38324 


Yugoslavia 


Homozygous 1048-1049delGA 


E340fs40 (frameshift NHL 5) 


l^JL/ f\jJJ 


Israel 


Homozygous 373-382dellObp 


T125fsl03 (frameshift) 


LD628 


Italy 


Homozygous 661-692del32bp 


V16fsl (frameshift NHL 3) 


LD483 


Italy 


Homozygous 260T-^C 


L87P (missense) 


LD22830 


Canada 


Homozygous 905A->C 


Q29P (missense NHL 4) 


LD32817 


Pakistan 


Homozygous 98T-->C 


F33S (missense RING finger) 


LD25-9 


Saudi Arabia 


Homozygous 892ins2T 


^908fQl ^ f frameshift NHL 4) 


LD5487 


Denmark 


Heterozygous 436G->A 
nOOdelT 


D146N (missense NHL 1) 
V362fs20 (frameshift) 


LD5489 


Denmark 


Homozygous llOOdelT 


V362fs20 (frameshift) 


Laf 25 


Italian 


606delT Homozygous 


F204fs27 


Laf41 


Italian 


923A-4'T Homozygous 


D308V 


3422 


Sudanese 


580G->T Homozygous 


G194C 


Laf44 


1 Italian 


199G-)'T Homozygous 


E67X 
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